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CIiONZNG OF CYTOCHROME P450 GENES FROM NICOTIAMA. 

The present invention relates to nucleic acid sequences 
encoding cytochrome p450 enzymes (hereinafter referred to as 
5 p450 and p450 enzymes) in ISflcotiana plants and methods for 

using those nucleic acid sequences to alter plant pheno types. 

BACKGROUND 

10 Cytochrome p450s catalyze enzymatic reactions for a 

diverse range of chemically dissimilar substrates that include 
the oxidative, peroxidative and reductive metabolism of 
endogenous and xenobiotic substrates. In plants, p450s 
participate in biochemical pathways that include the synthesis 

15 of plant products such as phenylpropanoids , alkaloids, 

terpenoids, lipids, cyanogenic glycosides, and glucosinolates 
(Chappel, Annu. Rev. Plant Physiol. Plant Mol . Biol. 198, 
49:311-343). Cytochrome p450s, also known as p450 heme- 
thiolate proteins, usually act as terminal oxidases in multi- 

20 component electron transfer chains, called p45 0- containing 

monooxygenase systems. Specific reactions catalyzed include 
demethylation, hydroxylation, epoxidation, N-oxidation, 
sulf ooxxdation, N-, S~, and O- dealkylations, desulfation, 
deamination, and reduction of azo, nitro, and N-oxide groups. 

25 

The diverse role of Nicotiana plant p450 enzymes has been 
implicated in effecting a variety of plant metabolites such as 
phenylpropanoids, alkaloids, terpenoids, lipids, cyanogenic 
glycosides, glucosinolates and a host of other chemical 
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entities. During recent years, it is becoming apparent that 
some p450 enzymes can impact the composition of plant 
metabolites in plants. For example, it has been long desired 
to improve the flavor and aroma of certain plants by altering 
its profile of selected fatty acids through breeding; however 
very little is known about mechanisms involved in controlling 
the levels of these leaf constituents. The down regulation of 
p450 enzymes associated with the modification of fatty acids 
may facilitate accumulation of desired fatty acids that 
provide more preferred leaf phenotypic qualities. The 
function of p450 enzymes and their broadening roles in plant 
constituents is still being discovered. For instance, a 
special class of p450 enzymes was found to catalyze the 
breakdown of fatty acid into volatile C6- and C9-aldehydes and 
-alcohols that are major contributors of ''fresh green" odor of 
fruits and vegetables. The level of other novel targeted p450s 
may be altered to enhance the qualities of leaf constituents 
by modifying lipid composition and related break down 
metabolites in Nicotiana leaf. Several of these constituents 
in leaf are affected by senescence that stimulates the 
maturation of leaf quality properties. Still other reports 
have shown that p450s enzymes are play a functional role in 
altering fatty acids that are involved in plant -pathogen 
interactions and disease resistance. 

In other instances, p450 enzymes have been suggested to 
be involved in alkaloid biosynthesis. Nornicotine is a minor 
alkaloid found in Nicotiana tabaceum. It has been postulated 
that it is produced by the p450 mediated demethylation of 
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nicotine followed by acylation and nitrosation at the N 
position thereby producing a series of N-acylnonicotines and 
N-ni trosonornico tines - N-demethylation, catalyzed by a 
putative p450 deitiethylase, is thought to be a primary source 
of nornicotine biosyntheses in Nicotlana. While the enzyme is 
believed to be microsoiual , thus far a nicotine demethylase 
enzyme has not been successfully purified, nor have the genes 
involved been isolated. 

Furthermore, it is hypothesized but not proven that the 
activity of p450 enzymes is genetically controlled and also 
strongly influenced by environment factors. For example, the 
demethylation of nicotine in Nlcotiana is thought to increase 
substantially when the plants reach a mature stage. 
Furthermore, it is hypothesized yet not proven that the 
demethylase gene contains a transposable element that can 
inhibit translation of RNA when present. 

The large multiplicity of p450 enzyme forms, their 
differing structure and function have made their research on 
Nicotiana p450 enzymes very difficult before the enclosed 
invention. In addition^ cloning of p450 enzymes has been 
hampered at least in part because these membrane-localized 
proteins are typically present in low abundance and often 
unstable to purification. Hence, a need exists for the 
identification of p450 enzymes in plants and the nucleic acid 
sequences associated with those p450 enzymes. In particular, 
only a few cytochrome p450 proteins have been reported in 
Nicotiana. The inventions described herein entail the 
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discovery of a substantial ntimber of cytochrome p450 fragments 
that correspond to several groups of p450 species based on 
their sequence identity. 

5 SUMMARY 

The present invention is directed to plant p450 enzymes. 
The present invention is further directed to plant p450 
enzymes from Nicotiana. The present invention is also 
directed to p450 enzymes in plants whose expression is induced 

10 by ethylene and/or plant senescence. The present invention is 

yet further directed to nucleic acid sequences in plants 
having enzymatic activities, for example, being categorized as 
oxygenase, demethylase and the like, or other and the use of 
those sequences to reduce or silence the expression or over- 

15 expression of these enzymes. The invention also relates to 

p450 enzymes found in plants containing higher nornicotine 
levels than plants exhibiting lower nornicotine levels. 

In one aspect, the invention is directed to nucleic acid 
20 sequences as set forth in SEQ. ID. Nos . 1, 3, 5, 7, 9, 11, 13, 

15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 
45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 
75, 77, 79, 81, 83, 85, 87, 89, 91, 95, 97, 99, 101, 103, 105, 
107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 
25 131, 133, 135, 137, 139, 143, 145, 147, 149, 151, 153, 155, 

157, 159, 161, 153, 155, 167, 169, 171, 173, 175, 177, 179, 
181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 
205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 
229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 
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253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 
277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 
301, 303, 305, 307, 309, 311, 313 and 315. 

5 In a second related aspect, those fragments containing 

greater than 75% identity in nucleic acid sequence were placed 
into groups dependent upon their identity in a region 
corresponding to the first nucleic acid following the 
cytochrome p450 motif GXRXCX(G/A) to the stop codon. The 
10 representative nucleic acid groups and respective species are 

shown in Table I . 



In a third aspect, the invention is directed to amino 





acid 


[ sequences 


as set forth in 


SEQ. 


ID. 


Nos . 


2, 4, 


6, 8, 


10, 


15 


12, 


14, 16, 18, 


20, 


22, 


24, 26, 28, 


30, 


32, 


34, 36, 38, 


40, 




42, 


44, 46, 48, 


50, 


52, 


54, 56, 58, 


60, 


62, 


64, 66, 68, 


70, 




72, 


74, 76, 78, 


80, 


82, 84, 86, 


88, 


90, 92, 96, 98, 


100, 


102, 




104, 


106, 


108, 


110, 


112, 


114, 


116, 


118, 


120, 


122, 


124, 


126, 




128, 


130, 


132, 


134, 


136, 


138, 


140, 


144, 


146, 


148, 


150, 


152, 


20 


154, 


156, 


158, 


160, 


162, 


164, 


165, 


168, 


170, 


172, 


174, 


176, 




178, 


180, 


182, 


184, 


186, 


188, 


190, 


192, 


194, 


196, 


198, 


200, 




202, 


204, 


206, 


208, 


210, 


212, 


214, 


216, 


218, 


220, 


222, 


224, 




226, 


228, 


230, 


232, 


234, 


236, 


238, 


240, 


242, 


244, 


246, 


248, 




250, 


252, 


254, 


256, 


258, 


260, 


262, 


264, 


266, 


268, 


270, 


272, 


25 


274, 


276, 


278, 


280, 


282, 


284, 


286, 


288, 


290, 


292, 


294, 


296, 




298, 


300, 


302, 


304, 


306, 


308, 310, 312, 


314 and 316. 





In a fourth related aspect, those fragments containing 
greater than 71% identity in amino acid sequence were placed 
30 into groups dependent upon their identity to each other in a 
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region corresponding to the first amino acid following the 

cytochrome p450 motif GXRXCX(G/A) to the stop codon. The 

representative amino acid groups and respective species are 
shown in Table II . 

In a fifth aspect, the invention is directed to amino 
acid sequences of full length genes as set forth in SEQ, ID. 
Nos. 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 175, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 
220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 265, 
268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 
292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314 and 
316. 

In a sixth related aspect, those full length genes 
containing 85% or greater identity in amino acid sequence were 
placed into groups dependent upon the identity to each other. 
The representative amino acid groups and respective species 
are shown in Table III. 

In a seventh aspect, the invention is directed to amino 
acid sequences of the fragments set forth in SEQ . ID. Nos. 
299-357, 

In the eighth related aspect, those fragments containing 
90% or greater identity in amino acid sequence were placed 
into groups dependent upon their identity to each other in a 
region corresponding to the first cytochrome p450 domain. 
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UXXRXXZ, to the third cytochrome domain, GXRXO, where U is E 
or K, X is any amino acid and Z is T, S or M. The 

representative amino acid groups respective species shown in 
Table IV. 

5 

In a ninth related aspect, the reduction or elimination 
or over- expression of p45 0 enzymes in l^icotiana plants may be 
accomplished transiently using RNA viral systems. 

10 Resulting transformed or infected plants are assessed for 

phenotypic changes including, but not limited to, analysis of 
endogenous p45 0 RNA transcripts, p450 expressed peptides, and 
concentrations of plant metabolites using techniques commonly 
available to one having ordinary skill in the art- 

15 

In a tenth important aspect, the present invention is 
also directed to generation of trangenic Nicotiana lines that 
have altered p450 enzyme activity levels. In accordance with 
the invention, these transgenic lines include nucleic acid 

20 sequences that are effective for reducing or silencing or 

increasing the expression of certain enzyme thus resulting in 
phenotypic effects within Nicotiana. Such nucleic acid 
sequences include SEQ. ID. Nos. 1, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 

25 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 

79, 81, 83, 85, 87, 89, 91, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 
133, 135, 137, 139, 143, 145, 147, 149, 151, 153, 155, 157, 
159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 

30 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 
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207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 

231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 

255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 

279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 

5 303, 305, 307, 309, 311, 313 and 315. 

In a very important eleventh aspect of the invention, 
plant cultivars including nucleic acids of the present 
invention in a down regulation capacity using either full 
10 length genes or fragments thereof or in an over-expression 

capacity using full length genes will have altered metabolite 
profiles relative to control plants. 

In a twelfth aspect of the invention, plant cultivars 

15 including nucleic acid of the present invention using either 

full length genes or fragments thereof in modifying the 
biosynthesis or breakdown of metabolites derived from the 
plant or external to the plants, will have use in tolerating 
certain exogenous chemicals or plant pests. Such nucleic acid 

20 sequences include SEQ ID. Nos. 1, 3, 5, 7, 9, 11, 13, 15, 17, 

19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 
49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81, 83, 85, 87, 89, 91, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 

25 133, 135, 137, 139, 143, 145, 147, 149, 151, 153, 155, 157, 

159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 
183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 
207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 
231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 

30 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 

8 
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279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 
303, 305, 307, 309, 311, 313 and 315, 

In a thirteenth aspect, the present invention is directed 
5 to the screening of plants, more preferably Nicotiana, that 

contain genes that have substantial nucleic acid identity to 
the taught nucleic acid sequence. The use of the invention 
would be advantageous to identify and select plants that 
contain a nucleic acid sequence with exact or substantial 
10 identity where such plants are part of a breeding program for 

traditional or transgenic varieties, a mutagenesis program, or 
naturally occurring diverse plant populations. The screening 
of plants for substantial nucleic acid identity may be 
accomplished by evaluating plant nucleic acid materials using 
15 a nucleic acid probe in conjunction with nucleic acid 

detection protocols including, but not limited to, nucleic 
acid hybridization and PGR analysis- The nucleic acid probe 
may consist of the taught nucleic acid sequence or fragment 
thereof corresponding to SEQ ID 1, 3, 5, 7, 9, 11, 13, 15, 17, 



20 



25 



19, 


21, 23, 25, 


27, 


29, 


31, 33, 35, 


37, 


39, 


41, 43, 45, 


47, 


49, 


51, 53, 55, 


57, 


59, 


61, 63, 65, 


67, 


69, 


71, 73 


, 75, 


77, 


79, 


81, 83, 85, 


87, 


89, 


91, 95, 97, 


99, 


101, 


103, 


105, 


107, 


109, 


111, 


113, 


115, 


117, 


119, 


121, 


123, 


125, 


127, 


129, 


131, 


133, 


135, 


137, 


139, 


143, 


145, 


147, 


149, 


151, 


153, 


155, 


157, 


159, 


161, 


163, 


165, 


167, 


169, 


171, 


173, 


175, 


177, 


179, 


181, 


183, 


185, 


187, 


189, 


191, 


193, 


195, 


197, 


199, 


201, 


203 , 


205, 


207, 


209, 


211, 


213, 


215, 


217, 


219, 


221, 


223, 


225, 


227, 


229, 


231, 


233, 


235, 


237, 


239, 


241, 


243, 


245, 


247, 


249, 


251, 


253 , 


255, 


257, 


259, 


261, 


263, 


265, 


267, 


269, 


271, 


273, 


275, 


277, 
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279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 
303, 305, 307, 309, 311, 313 and 315. 

In a fourteenth aspect, the present invention is directed 
to the identification of plant genes, more preferably 
Nicotiana, that share substantial amino acid identity 
corresponding to the taught nucleic acid sequence. The 
identification of plant genes including both cDNA and genomic 
clones, those cDNAs and genomic clones, more preferably from 
Nicotiana may be accomplished by screening plant cDNA 
libraries using a nucleic acid probe in conjunction with 
nucleic acid detection protocols including, but not limited 
to, nucleic acid hybridization and PGR analysis. The nucleic 
acid probe rriay be comprised of nucleic acid sequence or 
fragment thereof corresponding to SEQ ID 1, 3, 5, 7, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 
43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 
73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 95, 97, 99, 101, 103, 
105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 143, 145 and 147. 

In an alterative fifteenth aspect, cDNA expression 
libraries that express peptides may be screened using 
antibodies directed to part or all of the taught amino acid 
sequence. Such amino acid sequences include SEQ ID 2 , 4, 8, 
9, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 
70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 96, 98, 100, 
102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 
126, 128, 130, 132, 134, 136, 138, 140, 144, 146, 148. 
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In a sixteenth important aspect, the present invention is 
also directed to generation of transgenic Nicotiana lines that 
have over-expression of p450 enzyme activity levels. In 
accordance with the invention, -these transgenic lines include 
all nucleic acid sequences encoding the amino acid sequences 
of full length genes that are effective for increasing the 
expression of certain enzyme thus resulting in phenotypic 
effects within Nicotiana. Such amino acid sequences include 



SEQ. 


ID. 


150, 


152, 


154, 


156, 


158, 


160, 


162, 


164, 


166, 


168, 


170, 


172, 


174, 


176, 


178, 


180, 


182, 


184, 


186, 


188, 


190, 


192, 


194, 


196, 


198, 


200, 


202, 


204, 


206, 


208, 


210, 


212, 


214, 


216, 


218, 


220, 


222, 


224, 


226, 


228, 


230, 


232, 


234, 


236, 


238, 


240, 


242, 


244, 


246, 


248, 


250, 


252, 


254, 


256, 


258, 


260, 


262, 


264, 


266, 


268, 


270, 


272, 


274, 


276, 


278, 


280, 


282, 


284, 


286, 


288, 


290, 


292, 


294, 


296, 


298. 


300, 


302, 


304, 


306, 


308, 


310, 


312, 



314 and 316. 

A tobacco product is also provided that includes tobacco 
leaf (lamina and/or stem) having reduced amounts of 
nornicotine. The tobacco product includes tobacco (tobacco 
leaf including lamina and/ or stem) from a plant that includes 
the sequences described herein or where genes encoding tobacco 
specific nitrosamines have been eliminated or suppressed. The 
elimination or suppression of genes encoding tobacco speicific 
nitrosamines is effective for reducing tobacco specific 
nitrosamines in the tobacco products from about 5 to about 
10%, in another aspect from about 10 to 20%, in another aspect 
about 2 0 to 30%, and in another aspect greate than 3 0%, as 
compared to tobacco products made from tobacco plants where 
gnees coding for tobacco specific nitrosamines have not been 
eliminated or suppressed. As used herein, the tobacco product 
may include cigarettes, cigars, pipe tobacco, snuff chewing 
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tobacco, products blended with the tobacco product, and 
mixtures thereof . 

BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 shows nucleic acid SEQ. ID. No. :1 and amino acid 
SEQ . ID . No . : 2 . 

Figure 2 shows nucleic acid SEQ. ID. No. :3 and amino acid 
SEQ, ID. No. :4. 

Figure 3 shows nucleic acid SEQ. ID. No. :5 and amino acid 
SEQ. ID. No. :6. 

Figure 4 shows nucleic acid SEQ. ID. No. :7 and amino acid 
SEQ . ID . No . : 8 . 

Figure 5 shows nucleic acid SEQ. ID. No. :9 and amino acid 
SEQ. ID. No. :10. 

Figure 6 shows nucleic acid SEQ. ID. No.: 11 and amino 
acid SEQ. ID. No.: 12. 

Figure 7 shows nucleic acid SEQ. ID. No.: 13 and amino 
acid SEQ . ID . No . : 14 . 

Figure 8 shows nucleic acid SEQ. ID. No.: 15 and amino 
acid SEQ. ID. No. :16. 

Figure 9 shows nucleic acid SEQ. ID. No.: 17 and amino 
acid SEQ. ID. No.: 18. 

Figure 10 shows nucleic acid SEQ. ID. No.:19 and amino 
acid SEQ. ID. No.: 20. 

Figure 11 shows nucleic acid SEQ. ID. No.:21 and amino 
acid SEQ. ID. No.:22. 

Figure 12 shows nucleic acid SEQ. ID. No.: 23 and amino 
acid SEQ. ID. No.: 24. 

Figure 13 shows nucleic acid SEQ. ID. No.: 25 and amino 
acid SEQ. ID. No.: 26. 
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Figure 14 shows nucleic acid SEQ- ID. No. ;27 and amino 
acid SEQ. ID. No.: 28. 

Figure 15 shows nucleic acid SEQ. ID. No.:29 and amino 
acid SEQ. ID. No.^-30. 
5 Figure 16 shows nucleic acid SEQ. ID. No,: 31 and amino 

acid SEQ. ID. No.: 32. 

Figure 17 shows nucleic acid SEQ. ID. No.:33 and amino 
acid SEQ. ID. No.:34. 

Figure 18 shows nucleic acid SEQ. ID. No.:35 and amino 
10 acid SEQ. ID, No.: 36. 

Figure 19 shows nucleic acid SEQ. ID. No.: 37 and amino 
acid SEQ. ID. No.: 38. 

Figure 2 0 shows nucleic acid SEQ, ID, No. :39 and amino 
acid SEQ. ID. No.: 40. 
15 Figure 21 shows nucleic acid SEQ. ID. No.: 41 and amino 

acid SEQ. ID. No.: 42. 

Figure 22 shows nucleic acid SEQ. ID. No.: 43 and amino 
acid SEQ. ID. No.:44. 

Figure 2 3 shows nucleic acid SEQ. ID. No.: 45 and amino 
20 acid SEQ. ID. No.: 46. 

Figure 24 shows nucleic acid SEQ. ID. No.: 47 and amino 
acid SEQ. ID. No. :48. 

Figure 25 shows nucleic acid SEQ. ID. No.:49 and amino 
acid SEQ. ID. No. :50. 
25 Figure 26 shows nucleic acid SEQ. ID. No.: 51 and amino 

acid SEQ. ID. No. :52. 

Figure 27 shows nucleic acid SEQ, ID. No.: 53 and amino 
acid SEQ. ID. No.: 54. 

Figure 28 shows nucleic acid SEQ. ID. No.: 55 and amino 
30 acid SEQ. ID. No.: 56. 

Figure 29 shows nucleic acid SEQ. ID. No.: 57 and amino 
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acid SEQ. ID. No.:58. 

Figure 3 0 shows 
acid SEQ. ID. No.: 60. 

Figure -31 shows 
acid SEQ. ID. No.: 62. 

Figure 3 2 shows 
acid SEQ. ID. No . : 64 . 

F i gur e 33 shows 
acid SEQ. ID. No.:66. 

Figure 34 shows 
acid SEQ. ID. No.: 68. 

Figure 35 shows 
acid SEQ. ID. No. :70. 

F igure 3 6 shows 
acid SEQ. ID. No.: 72. 

Figure 37 shows 
acid SEQ. ID. No. :74. 

Figure 38 shows 
acid SEQ. ID. No.:76. 

Figure 39 shows 
acid SEQ. ID. No. :78. 

Figure 40 shows 
acid SEQ. ID. No. :80. 

Figure 41 shows 
acid SEQ. ID. No.: 82. 

Figure 42 shows 
acid SEQ. ID. No. :84. 

Figure 43 shows 
acid SEQ. ID. No. :86. 

Figure 44 shows 
acid SEQ. ID. No. :88. 



nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ, ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
nucleic acid SEQ. ID. 
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No . : 5 9 and amino 
No . : 61 and amino 
No,: 63 and amino 
No.: 65 and amino 
No . : 67 and amino 
No.: 69 and amino 
No. :71 and amino 
No.: 73 and amino 
No . : 7 5 and amino 
No.: 77 and amino 
No.: 79 and amino 
No . : 8 1 and amino 
No.: 83 and amino 
No.: 85 and amino 
No.: 87 and amino 
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Figure 45 shows nucleic 
acid SEQ. ID. No. : 90. 

Figure 46 shows nucleic 
acid SEQ. ID. No.: 92. 
5 Figure 48 shows nucleic 

acid SEQ. ID. No. :96. 

Figure 49 shows nucleic 
acid SEQ. ID. No.:98. 

Figure 50 shows nucleic 
10 acid SEQ. ID. No.: 100. 

Figure 51 shows nucleic 
acid SEQ. ID. No.: 102. 

'Figure 52 shows nucleic 
acid SEQ. ID. No.: 104. 
15 Figure 53 shows nucleic 

acid SEQ. ID. No.:106. 

Figure 54 shows nucleic 
acid SEQ. ID. No.: 108. 

Figure 55 shows nucleic 
20 acid SEQ. ID. No.:110. 

Figure 56 shows nucleic 
acid SEQ. ID. No. : 112 . 

Figure 57 shows nucleic 
acid SEQ. ID. No.: 114. 
25 Figure 58 shows nucleic 

acid SEQ. ID. No.:116. 

Figure 59 shows nucleic 
acid SEQ, ID. No.: 118. 

Figure 50 shows nucleic 
30 acid SEQ. ID. No.: 120. 

Figure 51 shows nucleic 
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acid SEQ. ID. No.: 89 and amino 
acid SEQ. ID. No.: 91 and amino 
acid SEQ. ID. No.: 95 and amino 
acid SEQ. ID. No.: 97 and amino 
acid SEQ. ID. No.:99 and amino 
acid SEQ. ID. No.:101 and amino 
acid SEQ. ID. No.: 103 and amino 
acid SEQ. ID. No.: 105 and amino 
acid SEQ. ID. No.: 107 and amino 
acid SEQ, ID. No.: 109 and amino 
acid SEQ. ID. No.: Ill and amino 
acid SEQ. ID. No.: 113 and amino 
a.cid SEQ, ID. No.: 115 and amino 
acid SEQ. ID. No.: 117 and amino 
acid SEQ. ID. No,: 119 and amino 
acid SEQ. ID., No.: 121 and amino 
15 
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acid SEQ. ID. No.: 122. 

Figure 62 shows nucleic 
acid SEQ. ID. No.: 124. 

Figure 63 shows nucleic 
acid SEQ. ID. No.:126. 

Figure 64 shows nucleic 
acid SEQ. ID. No.:128. 

Figure 65 shows nucl.eic 
acid SEQ. ID. No. :130. 

Figure 66 shows nucleic 
acid SEQ. ID. No.: 132. 

Figure 67 shows nucleic 
acid SEQ. ID. No. :134. 

Figure 68 shows nucleic 
acid SEQ. ID. No.: 13 6. 

Figure 69 shows nucleic 
acid SEQ. ID. No.: 138. 

Figure 70 shows nucleic 
acid SEQ. ID. No.: 140. 

Figure 72 shows nucleic 
acid SEQ. ID. No.:144. 

Figure 73 shows nucleic 
acid SEQ. ID. No.:146. 

Figure 74 shows nucleic 
acid SEQ. ID. No.: 148. 

Figure 75 shows nucleic 
acid SEQ. ID. No.: 150. 

Figure 7 6 shows nucleic 
acid SEQ. ID. No.: 152. 

Figure 77 shows nucleic 
acid SEQ. ID. No.: 154. 
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acid SEQ. ID. No.: 12 3 and amino 
acid SEQ. ID. No.;125 and amino 
acid SEQ. ID. No.:127 and amino 
acid SEQ- ID. No.: 129 and amino 
acid SEQ- ID. No.:131 and amino 
acid SEQ. ID. No.: 133 and amino 
acid SEQ. ID. No.:135 and amino 
acid SEQ. ID. No,: 137 and amino 
acid SEQ. ID. No.: 139 and amino 
acid SEQ. ID. No,: 143 and amino 
acid SEQ. ID. No.: 145 and amino 
acid SEQ. ID. No.:147 and amino 
acid SEQ. ID No,: 149 and amino 
acid SEQ. ID No.: 151 and amino 
acid SEQ. ID No.: 153 and amino 
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Figure 78 shows nucleic acid SEQ. 
acid SEQ. ID. No.: 156. 

Figure 79 shows nucleic acid SEQ. 
acid SEQ. ID. No.: 158. 

Figure 80 shows nucleic acid SEQ. 
acid SEQ. ID. No.: 160. 

Figure 81 shows nucleic acid SEQ. 
acid SEQ. ID. No-: 162. 

Figure 82 shows nucleic acid SEQ. 
acid SEQ. ID- No.: 164. 

Figure 83 shows nucleic acid SEQ, 
acid SEQ. ID. No.: 156. 

Figure 84 shows nucleic acid SEQ. 
acid SEQ. ID. No.: 168. 

Figure 85 shows nucleic acid SEQ. 
acid SEQ. ID. No.: 17 0. 

Figure 86 shows nucleic acid SEQ. 
acid SEQ. ID. No.: 172. 

Figure 87 shows nucleic acid SEQ. 
acid SEQ. ID. No. : 174. 

Figure 88 shows nucleic acid SEQ. 
acid SEQ. ID. No.: 176. 

Figure 89 shows nucleic acid SEQ. 
acid SEQ. ID. No.: 178, 

Figure 90 shows nucleic acid SEQ. 
acid SEQ. ID. No . : 180. 

Figure 91 shows nucleic acid SEQ. 
acid SEQ. ID. No.: 182. 

Figure 92 shows nucleic acid SEQ. 
acid SEQ. ID. No. : 184. 

Figure 93 shows nucleic acid SEQ. 
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ID No.: 155 and amino 
ID No. : 157 and amino 
ID No.: 159 and amino 
ID No.: 161 and amino 
ID No. : 163 and amino 
ID No. : 165 and amino 
ID No. : 167 and amino 
ID No.: 169 and amino 
ID No.: 171 and amino 
ID No.: 17 3 and amino 
ID No.: 17 5 and amino 
ID No . : 177 and amino 
ID No.: 179 and amino 
ID No. : 181 and amino 
ID No. : 183 and amino 
ID No.: 185 and amino 
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acid SEQ. ID. No.: 186. 

Figure 94 shows nucleic acid SEQ. ID No.: 187 and amino 
acid SEQ. ID. No.: 188. 

Figure 95 shows nucleic acid SEQ. ID No. : 189 and amino 
5 acid SEQ. ID. No.: 190. 

Figure 96 shows nucleic acid SEQ. ID No.: 191 and amino 
acid SEQ. ID. No.: 192. 

Figure 97 shows nucleic acid SEQ. ID No.: 193 and amino 
acid SEQ. ID. No.: 194. 
10 Figure 98 shows nucleic acid SEQ. ID No.: 195 and amino 

acid SEQ. ID. No.; 196. 

Figure 99 shows nucleic acid SEQ. ID, No.; 197 and amino 
acid SEQ. ID. No.: 198. 

Figure 100 shows nucleic acid SEQ. ID No.: 199 and amino 
15 acid SEQ. ID. No.: 200. 

Figure 101 shows nucleic acid SEQ. ID No.: 201 and amino 
acid SEQ. ID. No.: 2 02. 

Figure 102 shows nucleic acid SEQ. ID No.: 203 and amino 
acid SEQ. ID, No.; 2 04. 
20 Figure 103 shows nucleic acid SEQ. ID No. : 205 and amino 

acid SEQ. ID. No.: 206. 

Figure 104 shows nucleic acid SEQ. ID No. : 207 and amino 
acid SEQ. ID. No.: 208- 

Figure 105 shows nucleic acid SEQ. ID No.: 209 and amino 
25 acid SEQ. ID. No.: 210. 

Figure 106 shows nucleic acid SEQ. ID No.: 211 and amino 
acid SEQ. ID. No.: 212, 

Figure 107 shows nucleic acid SEQ. ID No.: 213 and amino 
acid SEQ. ID. No.: 214. 
30 Figure 108 shows nucleic acid SEQ. ID No. : 215 and amino 

acid SEQ. ID. No.: 216. 
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Figure 109 shows nucleic 
acid SEQ. ID. No.: 218. 

Figure 110 shows nucleic 
acid SEQ. ID. No.: 220. 
5 Figure 111 shows nucleic 

acid SEQ. ID. No.: 222. 

Figure 112 shows nucleic 
acid SEQ. ID. No.: 224. 

Figure 113 shows nucleic 
10 acid SEQ. ID. No.: 226. 

Figure 114 shows nucleic 
acid SEQ. ID. No.: 228. 

Figure 115 shows nucleic 
acid SEQ. ID. No.: 230, 
15 Figure 116 shows nucleic 

acid SEQ. ID. No.: 232. 

Figure 117 shows nucleic 
acid SEQ. ID. No.: 234, 

Figure 118 shows nucleic 
20 acid SEQ. ID. No.: 23 6. 

Figure 119 shows nucleic 
acid SEQ. ID. No. : 238. 

Figure 12 0 shows nucleic 
acid SEQ. ID. No.: 240. 
25 Figure 121 shows nucleic 

acid SEQ. ID. No.: 242. 

Figure 122 shows nucleic 
acid SEQ. ID. No.: 244. 

Figure 123 shows nucleic 
30 acid SEQ. ID. No.: 246. 

Figure 124 shows nucleic 



acid SEQ. ID No. : 217 and amino 
acid SEQ. ID No.: 219 and amino 
acid SEQ. ID No. : 221 and amino 
acid SEQ. ID No . : 223 and amino 
acid SEQ. ID No. : 225 and amino 
acid SEQ. ID No . : 227 and amino 
acid SEQ. ID No. : 229 and amino 
acid SEQ. ID No. : 231 and amino 
acid SEQ. ID No. : 233 and amino 
acid SEQ. ID No. : 23 5 and amino 
acid SEQ. ID No. : 237 and amino 
acid SEQ. ID No. : 239 and amino 
acid SEQ. ID No.: 241 and amino 
acid SEQ. ID No. : 243 and amino 
acid SEQ. ID No. : 245 and amino 
acid SEQ. ID No. : 247 and amino 
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acid SEQ. ID. No.: 248. 

Figure 125 shows nucleic 
acid SEQ. ID. No.: 250. 

Figure 12 6 shows nucleic 
acid SEQ. ID. No,: 252- 

Figure 127 shows nucleic 
acid SEQ. ID. No.: 254. 

Figure 128 shows nucleic 
acid SEQ. ID. No.: 256. 

Figure 129 shows nucleic 
acid SEQ. ID. No.: 258. 

Figure 130 shows nucleic 
acid SEQ. ID- No.: 260. 

Figure 131 shows nucleic 
acid SEQ. ID. No.: 262. 

Figure 132 shows nucleic 
acid SEQ. ID. No. : 264. 

Figure 133 shows nucleic 
acid SEQ. ID. No . : 266. 

Figure 134 shows nucleic 
acid SEQ. ID. No.: 268. 

Figure 135 shows nucleic 
acid SEQ. ID. No.: 270. 

Figure 13 6 shows nucleic 
acid SEQ. ID. No, : 272, 

Figure 137 shows nucleic 
acid SEQ, ID. No.: 274. 

Figure 138 shows nucleic 
acid SEQ. ID. No.: 276. 

Figure 139 shows nucleic 
acid SEQ. ID. No.: 278. 



acid SEQ. ID No . : 249 and amino 
acid SEQ. ID No. : 251 and amino 
acid SEQ. ID No. : 253 and amino 
acid SEQ. ID No. : 255 and amino 
acid SEQ- ID No . : 257 and amino 
acid SEQ, ID No. : 259 and amino 
acid SEQ. ID No, : 261 and amino 
acid SEQ. ID No . : 263 and amino 
acid SEQ. ID No. : 2 65 and amino 
acid SEQ. ID No, : 267 and amino 
acid SEQ. ID No. : 2 69 and amino 
acid SEQ. ID No. : 271 and amino 
acid SEQ. ID No. : 273 and amino 
acid SEQ. ID No. : 275 and amino 
acid SEQ. ID No. : 277 and amino 
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Figure 140 shows nucleic acid SEQ, ID No.: 279 and amino 
acid SEQ. ID. No.: 280. 

Figure 141 shows nucleic acid SEQ. ID No. : 281 and amino 
acid SEQ. ID. No.: 282. 
5 Figure 142 shows nucleic acid SEQ. ID No.: 283 and amino 

acid SEQ. ID. No.: 284. 

Figure 143 shows nucleic acid SEQ. ID No.: 285 and amino 
acid SEQ. ID. No.: 286. 

Figure 144 shows nucleic acid SEQ. ID No. : 287 and amino 
10 acid SEQ. ID. No.: 288. 

Figure 145 shows nucleic acid SEQ. ID No. : 289 and amino 
acid SEQ. ID. No.: 290. 

Figure 146 shows nucleic acid SEQ. ID No. : 291 and amino 
acid SEQ. ID. No.; 292. 
15 Figure 147 shows nucleic acid SEQ. ID No.: 293 and amino 

acid SEQ. ID. No.: 294- 

Figure 148 shows nucleic acid SEQ, ID No. : 295 and amino 
acid SEQ. ID. No.: 296. 

Figure 149 shows nucleic acid SEQ. ID No.: 297 and amino 
20 acid SEQ. ID. No.; 298. Figure 151 

shows a comparison of Seqiience Groups. 

Figure 152 illustrates alignment of full length clones. 

Figure 153 shows a procedure used for cloning of 
cytochrome p450 cDNA fragments by PGR 
25 Figure 154 shows nucleic acid SEQ. ID No . : 299 and amino 

acid SEQ. ID. No.: 300. 

Figure 155 shows nucleic acid SEQ. ID No. : 301 and amino 
acid SEQ. ID. No.: 302. 

Figure 156 shows nucleic acid SEQ. ID No . : 303 and amino 
30 acid SEQ. ID. No.: 304. 

Figure 157 shows nucleic acid SEQ. ID No.: 305 and amino 
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acid SEQ. ID. No.: 306. 

Figure 158 shows nucleic acid SEQ. ID No.: 307 and amino 
acid SEQ. ID. No.: 308. 

Figure 159 shows nucleic acid SEQ. ID No. : 309 and amino 
acid SEQ. ID. No.: 310. 

Figure 160 shows nucleic acid SEQ. ID No.: 311 and amino 
acid SEQ. ID. No.: 312. 

Figure 161 shows nucleic acid SEQ. ID No. : 313 and amino 
acid SEQ. ID. No.: 314. 

Figure 162 shows nucleic acid SEQ. ID No. : 315 and amino 
acid SEQ. ID. No.: 316. 

Figure 163 shows probe set sequences of all clones on 
GeneChip . 

DETAILED DESCRIPTION 

DEFINITIONS 

Unless defined otherwise, all technical and scientific 
terms used herein have the same meaning as commonly understood 
by one of ordinary skill in the art to which this invention 
belongs. Singleton et al . (1994) Dictionary of Microbiology 
and Molecular Biology, second edition, John Wiley and Sons 
(New York) provides one of skill with a general dictionary of 
many of the terms used in this invention. All patents and 
publications referred to herein are incorporated by reference 
herein. For purposes of the present invention, the following 
terms are defined below. 

''Enzymatic activity" is meant to include demethylation, 
hydroxylation, epoxidation, N-oxidation, sulf ooxidation, N-, 
S-, and O- dealkylations, desulfation, deamination, and 
reduction of azo, nitro, and N-oxide groups. The term 
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"nucleic acid" refers to a deoxyribonucleotide or 
ribonucleotide polymer in either single-- or double- stranded 
form, or sense or anti-sense, and unless otherwise limited, 
encompasses known analogues of natural nucleotides that 

5 hybridize to nucleic acids in a manner similar to naturally 

occurring nucleotides. Unless otherwise indicated, a 
particular nucleic acid sequence includes the complementary 
sequence thereof. The terms "operably linked", ^'in operable 
combination", and ''in operable order" refer to functional 

10 linkage between a nucleic acid expression control sequence 

(such as a promoter, signal sequence, or array of 
transcription factor binding sites) and a second nucleic acid 
sequence, wherein the expression control sequence affects 
transcription and/or translation of the nucleic acid 

15 corresponding to the second sequence. 

The term "recombinant" when used with reference to a cell 
indicates that the cell replicates a heterologous nucleic 
acid, expresses said nucleic acid or expresses a peptide, 

20 heterologous peptide, or protein encoded by a heterologous 

nucleic acid. Recombinant cells can express genes or gene 
fragments in either the sense or anti sense form that are not 
found within the native (non- recombinant) form of the cell. 
Recombinant cells can also express genes that are found in the 

25 native form of the cell; but wherein the genes are modified 

and re- introduced into the cell by artificial means. 

A "structural gene" is that portion of a gene comprising 
a DNA segment encoding a protein, polypeptide or a portion 
30 thereof, and excluding the 5' sequence which drives the 

initiation of transcription. The structural gene may 
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alternatively encode a nontranslatable product. The structural 
gene may be one which is normally found in the cell or one 
which is not noormally found in the cell or cellular location 
wherein it is introduced, in which case it is termed a 
'^heterologous gene" . A heterologous gene may be derived in 
whole or in part from any source known to the art, including 
a bacterial genome or episome, eukaryotic, nuclear or plasmid 
DNA, cDNA, viral DNA or chemically synthesized DNA. A 
structural gene may contain one or more modifications that 
could effect biological activity or its characteristics, the 
biological activity or the chemical structure of the 
expression product, the rate of expression or the manner of 
expression control. Such modifications include, but are not 
limited to, mutations, insertions, deletions and substitutions 
of one or more nucleotides. The structural gene may 
constitute an uninterrupted coding sequence or it may include 
one or more introns, bounded by the appropriate splice 
junctions. The structural gene may be translatable or non- 
translatable, including in an anti-sense orientation. The 
structural gene may be a composite of segments derived from a 
plurality of sources and from a plurality of gene sequences 
(naturally occurring or synthetic, where synthetic refers to 
DNA that is chemically synthesized) . 

''Derived from" is used to mean taken, obtained, received, 
traced, replicated or descended from a source (chemical and/or 
biological) . A derivative may be produced by chemical or 
biological manipulation (including, but not limited to, 
substitution, addition, insertion, deletion, extraction, 
isolation, mutation and replication) of the original source. 
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''Chemically synthesized'', as related to a sequence of 
DNA, means that portions of the component nucleotides were 
assembled in vitro. Manual chemical synthesis of DNA may be 
accomplished using well established procedures (Caruthers, 
Methodology of DNA and RNA Sequencing , (1983), Weissman (ed.), 
Praeger Publishers, New York, Chapter 1); automated chemical 
synthesis can be performed using one of a number of 
commercially available machines- 

Optimal alignment of sequences for comparison may be 
conducted by the local homology algorithm of Smith and 
Waterman, Adv. Appl, Math. 2:482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch, J. Mol. Biol. 
48:443 (1970), by the search for similarity method of Pearson • 
and Lipman Proc. Natl. Acad, Sci. (U.S.A.) 85: 2444 (1988), by 
computerized implementations of these algorithms (GAP, 
BESTFXT, FASTA, and TFASTA in the Wisconsin Genetics Software 
Package, Genetics Computer Group, 575 Science Dr., Madison, 
Wis.), or by inspection. 

The NCBI Basic Local Alignment Search Tool (BLAST) 
(Altschul et al., 1990) is available from several sources, 
including the National Center for Biological Information 
(NCBI, Bethesda, Md.) and on the Internet, for use in 
connection with the sequence analysis programs blastp, blastn, 
blastx, tblastn and tblastx. It can be accessed at 
htp: //www. ncbi .nlm.nih.gov/BLAST/ . A description of how to 
determine sequence identity using this program is available at 
http: / /www. ncbi .nlm.nih.gov/BLAST/blast help.html. 
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The terms "substantial amino acid identity" or 
"substantial amino acid sequence identity" as applied to amino 
acid sequences and as used herein denote a characteristic of 
a polypeptide, wherein the peptide comprises a sequence that 
has at least 7 0 percent sequence identity, preferably 80 
percent amino acid sequence identity, more preferably 90 
percent amino acid sequence identity, and most preferably at 
least 99 to 100 percent sequence identity as compared to a 
reference group over region corresponding to the first amino 
acid following the cytochrome p450 motif GXRXCX(G/A) to the 
stop codon of the translated peptide. 

The terms "substantial nucleic acid identity" or 
"substantial nucleic acid sequence identity" as applied to 
nucleic acid sequences and as used herein denote a 
characteristic of a polynucleotide sequence, wherein the 
polynucleotide comprises a sequence that has at least 75 
percent sequence identity, preferably 81 percent amino acid 
sequence identity, more preferably at least 91 percent 
sequence identity, and most preferably at least 99 to 100 
percent sequence identity as compared to a reference group 
over region corresponding to the first nucleic acid following 
the cytochrome p450 motif GXRXCX(G/A) to the stop codon of the 
translated peptide. 

Another indication that nucleotide sequences are 
substantially identical is if two molecules hybridize to each 
other under stringent conditions. Stringent conditions are 
sequence-dependent and will be different in different 
circumstances. Generally, stringent conditions are selected to 
be about 5*C to about 20 'C, usually about 10 'C to about 15 'C, 
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lower than the thermal melting point (Tm) for the specific 
sequence at a defined ionic strength and pH. The Tm is the 
temperature (under defined ionic strength and pH) at which 50% 
of the target sequence hybridijses to a matched probe. 
Typically, stringent conditions will be those in which the 
salt concentration is about 0.02 molar at pH 7 and the 
temperature is at least about 60 'C. For instance in a 
standard Southern hybridization procedure, stringent 
conditions will include an initial wash in 6xSSC at 42 *C 
followed by one or more additional washes in 0-2xSSC at a 
temperature of at least about 55 'C, typically about 60 'C and 
often about 65 'C. 

Nucleotide sequences are also substantially identical for 
purposes of this invention when the polypeptides and/or 
proteins which they encode are substantially identical. Thus, 
where one nucleic acid sequence encodes essentially the same 
polypeptide as a second nucleic acid sequence, the two nucleic 
acid sequences are substantially identical, even if they would 
not hybridize under stringent conditions due to degeneracy 
permitted by the genetic code (see, Darnell et al.i (1990) 
Molecular Cell Biology, Second Edition Scientific American 
Books W. H. Freeman and Company New York for an explanation of 
codon degeneracy and the genetic code) . Protein purity or 

homogeneity can be indicated by a niimber of means well knovm 
in the art, such as polyacrylamide gel electrophoresis of a 
protein sample, followed by visualization upon staining. For 
certain purposes high resolution may be needed and HPLC or a 
similar means for purification may be utilized. 

As used herein, the term "vector" is used in reference to 
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nucleic acid molecules that transfer DNA segment (s) into a 
cell. A vector may act to replicate DNA and may reproduce 
independently in a host cell. The term "vehicle" is sometimes 
used interchangeably with ^ "vector . " The term "expression 
vector" as used herein refers to a recombinant DNA molecule 
containing a desired coding sequence and appropriate nucleic 
acid sequences necessary for the expression of the operably 
linked coding sequence in a particular host organism. Nucleic 
acid sequences necessary for expression in prokaryotes usually 
include a promoter, an operator (optional) , and a ribosome 
binding site, often along with other sequences, Eucaryotic 
cells are known to utilize promoters, enhancers, and 
termination and polyadenylation signals. 

For the purpose of regenerating complete genetically 
engineered plants with roots, a nucleic acid may be inserted 
into plant cells, for example, by any technique such as in 
vivo inoculation or by any of the known in vitro tissue 
culture techniques to produce transformed plant cells that can 
be regenerated into complete plants. Thus, for example, the 
insertion into plant cells may be by in vitro inoculation by 
pathogenic or non-pathogenic A. tumefaciens . Other such tissue 
culture techniques may also be employed. 

'"Plant tissue" includes differentiated and 
undifferentiated tissues of plants, including, but not limited 
to, roots, shoots, leaves, pollen, seeds, tumor tissue and 
various forms of cells in culture, such as single cells, 
protoplasts, embryos and callus tissue. The plant tissue may 
be in plants or in organ, tissue or cell culture. 
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^'Plant cell" as used herein includes plant cells in 
planta and plant cells and protoplasts in culture. *\iCMA'' 
or "complementary DNA" generally refers to a single stranded 
DNA molecule with a nticleotide sequence that is complementary 
5 to an RJWA molecule. cDNA is formed by the action of the 

enzyme reverse transcriptase on an RNA template. 

STRATEGIES FOR OBTAINING ISfUCLEIC ACID SEQUENCES 

10 In accordance with the present invention, RNA was 

extracted from Nicotiana tissue of converter and non-converter 
Nicotiana lines. The extracted RNA was then used to create 
cDNA. Nucleic acid sequences of the present invention were 
then generated using two strategies . 

15 

In the first strategy, the poly A enriched RNA was 
extracted from plant tissue and cDNA was made by reverse 
transcription PCR. The single strand cDNA was then used to 
create p450 specific PCR populations using degenerate primers 

20 plus a oligo d(T) reverse primer. The primer design was based 

on the highly conserved motifs of p450. Examples of specific 
degenerate primers are set forth in Figure 1 . Sequence 
fragments from plasmids containing appropriate size inserts 
were further analyzed. These size inserts typically ranged 

25 from about 300 to about 800 nucleotides depending on which 

primers were used. 

In a second strategy, a cDNA library was initially 
constructed. The cDNA in the plasmids was used to create p450 
30 specific PCR populations using degenerate primers plus T7 

primer on plasmid as reverse primer. As in the first 

29 
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Strategy, sequence fragments from plasmids containing 
appropriate size inserts were further analyzed. 

Nicotiana- plant lines known to produce high levels of 
nornicotine (converter) and plant lines having undetectable 
levels of nornicotine may be used as starting materials. 

Leaves can then be removed from plants and treated with 
ethylene to activate p450 enzymatic activities defined herein. 
Total RNA is extracted using techniques known in the art. cDNA 
fragments can then be generated using PGR (RT-PCR) with the 
oligo d(T) primer as described in Figure 153. The cDNA library 
can then be constructed more fully described in examples 
herein . 

The conserved region of p450 type enzymes can be used as 
a template for degenerate primers (Figure 75) . Using 
degenerate primers, p450 specific bands can be amplified by 
PGR. Bands indicative for p450 like enzymes can be identified 
by DNA sequencing, PGR fragments can be characterized using 
BLAST search, alignment or other tools to identify appropriate 
candidates - 

Sequence information from identified fragments can be 
used to develop PGR primers. These primers in combination of 
plasmid primers in cDNA library were used to clone full length 
p450 genes. Large-scale Southern reverse analysis was 
conducted to examine the differential expression for all 
fragment clones obtained and in some cases full length clones. 
In this aspect of the invention, these large-scale reverse 
Southern assays can be conducted using labeled total cDNA's 
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from different tissues as a probe to hybridize with cloned DNA 
fragments in order to screen all cloned inserts. 

Nonradioactive and radioactive (P^^) Northern blotting 
assays were also used to characterize clones p450 fragments 
and full length clones. 

Peptide specific antibodies were made against several 
full-length clones by deriving their amino acid sequence and 
selecting peptide regions that were antigenic and unique 
relative to other clones. Rabbit antibodies were made to 
synthetic peptides conjugated to a carrier protein. Western 
blotting analyses or other immunological methods were 
performed on plant tissue using these antibodies. 

Nucleic acid sequences identified as described above can 
be examined by using virus induced gene silencing technology 
(VIGS, Baulcombe, Current Opinions in Plant Biology, 1999, 
2:109-113) - 

Peptide specific antibodies were made for several full- 
length clones by deriving their amino acid sequence and 
selecting peptide regions that were potentially antigenic and 
were unique relative to other clones. Rabbit antibodies were 
made to synthetic petides conjugated to a carrier protein. 
Western blotting analyses were perfomed using these 
antibodies . 

In another aspect of the invention, interfering RNA 
technology (RMAi) is used to further characterize cytochrome 
p450 enzymatic activities in Nicotiana plants of the present 



31 



wo 2005/038033 



PCT/US2004/034065 



invention. The following references which describe this 
technology are incorporated by reference herein. Smith et al . , 
Nature, 2000, 407:319-320; Fire et al . , Nature, 1998, 391:306- 
311; Waterhouse et al . , PNAS, 1998, 95:13959-13964; Stalberg 
etal.. Plant Molecular Biology, 1993, 23:671- 683; Baulconibe, 
Current Opinions in Plant Biology, 1999, 2:109-113; and 
Brigneti et al. , EMBO Journal, 1998, 17 (22) : 6739-6746 . Plants 
may be transformed using RNAi techniques, ant i sense 
techniques, or a variety of other methods described. 

Several techniques exist for introducing foreign genetic 
material into plant cells, and for obtaining plants that 
stably maintain and express the introduced gene. Such 
techniques include acceleration of genetic material coated 
onto microparticles directly into cells (US Patents 4,945,050 
to Cornell and 5,141,131 to DowElanco) . Plants may be 
transformed using Agrobacterixmi technology, see US Patent 
5,177,010 to University of Toledo, 5,104,310 to Texas A&M, 
European Patent Application 0131624B1, European Patent 
Applications 120516, 159418B1, European Patent Applications 
120516, 159418B1 and 176,112 to Schilperoot, US Patents 
5,149,645, 5,469,976, 5,464,7 63 and 4,940,838 and 4,693,976 to 
Schilperoot, European Patent Applications 116718, 290799, 
320500 all to MaxPlanck, European Patent Applications 504662 
and 627752 to Japan Nicotiana, European Patent Applications 
0267159, and 0292435 and US Patent 5,231,019 all to Ciba 
Geigy, US Patents 5,463,174 and 4,762,785 both to Calgene, and 
US Patents 5,004,853 and 5,159,135 both to Agracetus, Other 
transformation technology includes whiskers technology, see 
U.S. Patents 5,302,523 and 5,464,765 both to Zeneca. 
Electroporation technology has also been used to trans foarm 
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plants, see WO 87/06614 to Boyce Thompson Institute, 5,472,869 
and 5,384,253 both to Dekalb, WO9209696 and W09321335 both to 
PGS. All of these transformation patents and publications are 
incorporated by reference. In addition to numerous 

technologies for transforming plants, the type of tissue which 
is contacted with the foreign genes may vary as well. Such 
tissue would include but would not be limited to embryogenic 
tissue, callus tissue type I and II, hypocotyl, meristem, and 
the like. Almost all plant tissues may be transformed during 
dedif ferentiation using appropriate techniques within the 
skill of an artisan. 

Foreign genetic material introduced into a plant may 
include a selectable marker. The preference for a particular 
marker is at the discretion of the artisan, but any of the 
following selectable markers may be used along with any other 
gene not listed herein which could function as a selectable 
marker. Such selectable markers include but are not limited 
to aminoglycoside phosphotransferase gene of transposon Tn5 
(Aph II) which encodes resistance to the antibiotics 
kanamycin, neomycin and G418, as well as those genes which 
code for resistance or tolerance to glyphosate; hygromycin,- 
methotrexate; phosphinothricin (bar) ; imidazolinones , 
sulfonylureas and triazolopyrimidine herbicides, such as 
chlorosulfuron; bromoxynil, dalapon and the like. 

In addition to a selectable marker, it may be desirous to 
use a reporter gene. In some instances a reporter gene may be 
used without a selectable marker. Reporter genes are genes 
which are typically not present or expressed in the recipient 
organism or tissue. The reporter gene typically encodes for 
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a protein which provide for some phenotypic change or 
enzymatic property. Examples of such genes are provided in K. 
Weising et al. Ann. Rev. Genetics, 22, 421 (1988), which is 
incorporated herein by reference. Preferred reporter genes 
include without limitation glucuronidase (GUS) gene and GFP 
genes . 

Once introduced into the plant tissue, the expression of 
the structural gene may be assayed by any means known to the 
art, and expression may be measured as mRJSTA transcribed, 
protein synthesized, or the amount of gene silencing that 
occurs (see U.S. Patent No. 5,583,021 which is hereby 
incorporated by reference) , Techniques are known for the in 
vitro culture of plant tissue, and in a n^amber of cases, for 
regeneration into whole plants (EP Appln ISTo. 88810309.0). 
Procedures for transferring the introduced expression complex 
to commercially useful cultivars are known to those skilled in 
the art. 

Once plant cells expressing the desired level of p450 
enzyme are obtained, plant tissues and whole plants can be 
regenerated therefrom using methods and techniques well-known 
in the art. The regenerated plants are then reproduced by 
conventional means and the introduced genes can be transferred 
to other strains and cultivars by conventional plant breeding 
techniques , 

The following examples illustrate methods for carrying 
out the invention and should be understood to be illustrative 
of, but not limiting upon, the scope of the invention which is 
defined in the appended claims . 
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EXAMPLES 

EXAMPLE I; DEVELOPMENT OF PLANT TISSUE AMD ETHYLENE TREATMENT 
Plant Growth 

Plants were seeded in pots and grown in a greenhouse for 
4 weeks. The 4 week old seedlings were transplanted into 
individual pots and grown in the greenhouse for 2 months. The 
plants were watered 2 times a day with water containing 150ppm 
NPK fertilizer during growth. The expanded green leaves were 
detached from plants to do the ethylene treatment described 
below. 

Cell Line 78379 

Tobacco line 78379, which is a burley tobacco line 
released by the University of Kentucky was used as a source of 
plant material. One hundred plants were cultured as standard 
in the art of growing tobacco and transplanted and tagged with 
a distinctive number (1-100) . Fertilization and field 
management were conducted as recommended. 

Three quarters of the 10 0 plants converted between 20 and 
100% of the nicotine to nornicotine. One cjuarter of the 100 
plants converted less than 5% of the nicotine to nornicotine. 
Plant number 87 had the least conversion (2%) while plant 
number 21 had 100% conversion. Plants converting less than 3% 
were classified as non- converters . Self -pollinated seed of 
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plant number 87 and plant number 21, as well as crossed (21 x 
87 and 87 x 21) seeds were made to study genetic and 
phenotypic differences. Plants from selfed 21 were 

converters, and 99% of selfs from 87 were non- converters . The 
other 1% of the plants from 87 showed low conversion (5-15%) . 
Plants from reciprocal crosses were all converters. 

Cell Line 44Q7 

Nicotiana line 4407, which is a burley line was used as 
a source of plant material- Uniform and representative plants 
(100) were selected and tagged. Of the 100 plants 97 were 
non-converters and three were converters. Plant nximber 56 had 
the least amount of conversion (1.2%) and plant number 58 had 
the highest level of conversion (96%) . Self -poll ena ted seeds 
and crossed seeds were made with these two plants. 

Plants from selfed-58 segregated with 3:1 converter to 
non-converter ratio. Plants 58-33 and 58-25, were identified 
as homozygous converter and nonconverter plant lines, 
respectively. The stable conversion of 58-33 was confirmed by 
analysis of its progenies of next generation. 
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Cell Line PBLsm 

PBLBOl is a burley line developed by ProfiGen, Inc. and 
was used as a source of plant material. The converter plant 
was selected from foundation seeds of PBLBOl. 

Ethylene Treatment Procedures 

Green leaves were detached from 2-3 month greenhouse 
grown plants and sprayed with 0.3% ethylene solution (Prep 
brand Ethephon ( Rhone- Poulenc) ) . Each sprayed leaf was hung 
in a curing rack equipped with humidifier and covered with 
plastic. During the treatment, the sainple leaves were 
periodically sprayed with the ethylene solution. Approximately 
24-48 hour post ethylene treatment, leaves were collected for 
RNA extraction. Another sub-sample was taken for metabolic 
constituent analysis to determine the concentration of leaf 
metabolites and more specific constituents of interest such as 
a variety of alkaloids. 

As an example, alkaloids analysis could be performed as 
follows. Samples {0.1 g) were shaken at 150 rpm with 0 . 5 ml 
2N NaOH, and a 5 ml extraction solution which contained 
quinoline as an internal standard and methyl t-butyl ether. 
Samples were analyzed on a HP 6890 GC eqcuipped with a FID 
detector. A temperature of 250 'C was used for the detector 
and injector. An HP column (30m-0 . 32nm'-l'm) consisting of 
fused silica crosslinked with 5% phenol and 95% methyl silicon 
was used at a temperature gradient of 110-185 at 10 'C per 
minute. The coliimn was operated at 100 'C with a flow rate of 
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1.7cm^inin-^ with a split ratio of 40:1 with a 2-1 injection 
volume using helium as the carrier gas. 

EXAMPLE 2 : BNA ISQLATIQKI 

For RNA extractions, middle leaves from 2 month old 
greenhouse grown plants were treated with ethylene as 
described. The 0 and 24-48 hours samples were used for RNA 
extraction. In some cases, leaf samples under the senescence 
process were taken from the plants 10 days post flower-head 
removal. These samples were also used for extraction. Total 
RNA was isolated using Rneasy Plant Mini Kit® (Qiagen, Inc., 
Valencia, California) following manufacturer's protocol. 

The tissue sample was ground under liquid nitrogen to a 
fine powder using a DEPC treated mortar and pestle. 
Approximately 100 milligrams of ground tissue were transferred 
to a sterile 1.5 ml eppendorf tube. This sample tube was 
placed in liquid nitrogen until all samples were collected. 
Then, 450jli-1 of Buffer RLT as provided in the kit (with the 
addition of Mercaptoethanol) was added to each individual 
tube. The sample was vortexed vigorously and incubated at 56° 
C for 3 minutes. The lysate was thei;i, applied to the 
QIAshredder^" spin column sitting in a 2 -ml collection tube, 
and centrifuged for 2 minutes at maximum speed. The flow 
through was collected and 0.5 volume of ethanol was added to 
the cleared lysate. The sample is mixed well and transferred 
to an Rneasy® mini spin column sitting in a 2 ml collection 
tube. The sample was centrifuged for 1 minute at 10,000rpm. 
Next, 700jLil of buffer RWl was pipetted onto the Rneasy® column 
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and centrifuged for 1 minute at 10,000rpm, Buffer RPE was 
pipetted onto the Rneasy® colunin in a new collection tube and 
centrifuged for 1 minute at 10,000 rpm. Buffer RPE was again, 
added to the Rneasy® spin column and centrifuged for 2 minutes 
at maximiim speed to dry the membrane. To eliminate any 
ethanol carry over, the membrane was placed in a separate 
collection tube and centrifuged for an additional 1 minute at 
maximum speed. The Rneasy® column was transferred into a new 
1.5 ml collection tube, and 40 jj.1 of Rnase-free water was 
pipetted directly onto the Rneasy® membrane. This final elute 
tube was centrifuged for 1 minute at 10,000rpm. Quality and 
quantity of total RNA was analyzed by denatured formaldehyde 
gel and spectrophotometer. 

Poly(A)RNA was isolated using Oligotex™ poly A+ RNA 
purification kit (Qiagen Inc.) following manufacture's 
protocol. About 200 p.g total RNA in 250 |ul maximum volume was 
used. A volume of 250ul of Buffer OBB and 15 pi of Oligotex*^" 
suspension was added to the 250 ul of total RNA. The contents 
were mixed thoroughly by pipetting and incubated for 3 minutes 
at 70 *C on a heating block. The sample was then, placed at 
room temperature for approximately 20 minutes- The 
oligotexrmKNA complex was pelleted by centrifugation for 2 
minutes at maximum speed. All but 50 ul of the supernatant 
was removed from the microcentrifuge tube. The sample was 
treated further by OBB buffer. The oligotextmRNA pellet was 
.resuspended in 400 \il of Buffer 0W2 by vortexing. This mix was 
transferred onto a small spin column placed in a new tube and 
centrifuged for 1 minute at maximiim speed. The spin column 
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was transferred to a new tube and an additional 400 ]il of 
Buffer 0W2 was added to the coliiinn. The tube was then 
centrifuged for 1 minute at maximum speed. The spin column 
was transferred to a final 1.5ml microcentrifuge tube.^ The 
sample was eluted with 60 ul of hot (70°C) Buffer OEB. Poly A 
product was analyzed by denatured formaldehyde gels and 
spectrophotometric analysis. 

EXAMPLE 3 : REVERSE TRANSCRIPT ION- PGR 

First strand cDNA was produced using Superscript 
reverse transcriptase following manufacturer's protocol 
(Invitrogen, Carlsbad, California) , The poly A+ enriched 
RNA/oligo dT primer mix consisted of less than 5 ]ig of total 
KNA, 1 ]xl of lOmM dMTP mix, 1 ul of Oligo d(T) 3.2^18 
(0.5ug/ul), and up to 10 ul of DEPC-treated water. Each 
sample was incubated at 65 'C for 5 minutes, then placed on 
ice for at least 1 minute. A reaction mixture was prepared 
by adding each of the following components in order: 2 ul 
lOX RT buffer, 4 ul of 25 mM MgC12, 2ul of 0.1 M DTT, and 1 
ul of RNase OUT Recombinant RNase Inhibitor. An addition of 
9 ul of reaction mixture was pipetted to each RNA/primer 
mixture and gently mixed. It was incubated at 42 'C for 2 
minutes and 1 ul of Super Script 11"^^ RT was added to each 
tube. The tube was incubated for 50 minutes at 42 'C. The 
reaction was terminated at 70" C for 15 minutes and chilled 
on ice. The sample was collected by centrif ugation and 1 ul 
of RNase H was added to each tube and incubated for 20 
minutes at 37 'C. The second PCR was carried out with 200 
pmoles of forward primer (degenerate primers as in Figure 
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75, SEQ.ID Nos. 149-156) and 100 pmoles reverse primer (mix 
of 18nt oligo d(T) followed by 1 random base) . 

Reaction conditions were 94 'C for 2 minutes and then 
performed 40 cycles of PGR at 94 "C for 1 minute, 45* to 60 'C 
for 2 minutes, 72 *C for 3 minutes with a 72*C extension for 
an extra 10 min. 

Ten microliters of the amplified sample were analyzed 
by electrophoresis using a 1% agarose gel. The correct size 
fragments were purified from agarose gel. 

EXAMPLE 4; GENERATION OF PGR FRAGMENT PQPULATIQNg 

PGR fragments from Example 3 were ligated into a pGEM- 
T® Easy Vector (Promega, Madison, Wisconsin) following 
manufacturer's instructions. The ligated product was 
transformed into JM109 competent cells and plated on LB 
media plates for blue/white selection. Colonies were 
selected and grown in a 95 well plate with 1.2 ml of LB 
media overnight at 37 'G. Frozen stock was generated for all 
selected colonies. Plasmid DNA from plates were purified 
using Beckman's Biomeck 2000 miniprep robotics with Wizard 
SV Miniprep® kit (Promega) . Plasmid DNA was eluted with 
lOOfiLwater and stored in a 96 well plate • Plasmids were 
digested by EcoRl and were analyzed using 1% agarose gel to 
confirm the DNA quantity and size of inserts. The plasmids 
containing a 400-600 bp insert were sequenced using an CEQ 
2000 sequencer (Beckman, Fullerton, California) . The 
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sequences were aligned with GenBank database by BLAST 
search. The p450 related fragments were identified and 
further analyzed. Alternatively, p45 0 fragments were 
isolated from substraction libraries. These fragments were 
also analyzed as described above. 

EXAMPLE 5 : CONSTRUCTION OF CDNA LIBRfiJlY 

A cDNA library was constructed by preparing total RNA 
from ethylene treated leaves as follows. First, total RNA 
was extracted from ethylene treated leaves of tobacco line 
58-33 using a modified acid phenol and chloroform extraction 
protocol. Protocol was modified to use one gram of tissue 
that was ground and subsequently vortexed in 5 ml of 
extraction buffer (100 mM Tris-HCl, pH 8.5; 200 raM NaCl; 
lOmM EDTA; 0.5% SDS) to which 5 ml phenol (pH5.5) and 5 ml 
chloroform was added. The extracted sample was centrifuged 
and the supernatant was saved. This extraction step was 
repeated 2-3 more times until the supernatant appeared 
clear. Approximately 5 ml of chloroform was added to remove 
trace amounts of phenol. RNA was precipitated from the 
combined supernatant fractions by adding a 3 -fold voliime of 
ETOH and 1/10 volume of 3M NaOAc (pH5.2) and storing at - 
20^0 for 1 hour. After transferring to a Corex glass 
container the RNA fraction was centrifuged at 9,000 RPM for 
45 minutes at 4^0. The pellet was washed with 7 0% ethanol 
and spun for 5 minutes at 9,000 RPM at 4^C. After drying 
the pellet, the pelleted RNA was dissolved in 0.5 ml RNase 
free water. The pelleted RNA was dissolved in 0.5 ml RNase 
free water. The quality and quantity of total RNA was 
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analyzed by denatured formaldehyde gel and 
spectrophotometer, respectively. 

The resultant total RNA was isolated for poly A+ RNA 
using an Oligo(dT) cellulose protocol (Invitrogen) and 
Microcentrifuge spin colximns (Invitrogen) by the following 
protocol. Approximately twenty mg of total RNA was 
subjected to twice purification to obtain high quality poly 
A+ RNA. Poly A+ RNA product was analyzed by performing 
denatured formaldehyde gel and subsequent RT-PCR of known 
full-length genes to ensure high quality of mRNA. 

Next, poly A-f RNA was used as template to produce a 
cDNA library employing cDNA synthesis kit, ZAP-cDNA® 
synthesis kit, and ZAP-cDNA® Gigapack® III gold cloning kit 
(Stratagene, La Jolla, California) . The method involved 
following the manufacture's protocol as specified. 
Approximately 8 \xg of poly A-«- RNA was used to construct cDNA 
library. Analysis of the primary library revealed about 2.5 
X 10^ - Ix 10*^ pfu. A quality background test of the 
library was completed by complementation assays using IPTG 
and x-gal, where recombinant plaques was expressed at more 
than 100-fold above the background reaction. 

A more quantitative analysis of the library by random 
PGR showed that average size of insert cDNA was 
approximately 1.2 kb. The method used a two-step PGR method 
as followed. For the first step, reverse primers were 
designed based on the preliminary sequence information 
obtained from p450 fragments. The designed reverse primers 
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and T3 (forward) primers were used amplify corresponding 
genes from the cDNA library. PGR reactions were subjected 
to agarose electrophoresis and the corresponding bands of 
high molecular weight were excised, purified, cloned and 
sequenced. In the second step, new primers designed from 
5'UTR or the start coding region of p450 as the forward 
primers together with the reverse primers (designed from 
3'UTR of p450) were used in the subsequent PGR to obtain 
full-length p450 clones. 

The p450 fragments were generated by PGR amplification 
from the constructed cDNA library as described in Example 3 
with the exception of the reverse primer. The T7 primer 
located on the plasmid downstream of cDNA inserts (see 
Figure 75) was used as a reverse primer. PGR fragments were 
Isolated, cloned and sequenced as described in Example 4. 

Full-length p4 50 genes were isolated by PGR method from 
constructed cDNA library. Gene specific reverse primers 
(designed from the downstream sequence of p45 0 fragments) 
and a forward primer (T3 on library plasmid) were used to 
clone the full length genes. PGR fragments were isolated, 
cloned and sequenced. If necessary, second step PGR was 
applied. In the second step, new forward primers designed 
from 5'UTR of cloned p450s together with the reverse primers 
designed from 3'UTR of p450 clones were used in the 
subsequent PGR reactions to obtain full-length p450 clones. 
The clones were subsequently sequenced. 
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EXAMPLE 6: CH ARACTERIZATION OF CLONED FRAGMENTS - REVERSE 
SOUTHERN BLOTTING ANALYSIS 

Nonradioactive lar^e scale reverse southern blotting 
assays were performed on all p450 clones identified in above 
examples to detect the differential expression. It was 
observed that the level of expression among different p45 0 
clusters was very different. Further real time detection 
was conducted on those with high expression. 

Nonradioactive Southern blotting procedures were 
conducted as follows. 

1) Total RNA was extracted from ethylene treated and 
nontreated converter (58-33) and nonconverter (58-25) leaves 
using the Qiagen Rnaeasy kit as described in Example 2. 

2) Probe was produced by biotin-tail labeling a single 
strand cDNA derived from poly A+ enriched RNA generated in 
above step. This labeled single strand cDNA was generated 
by RT-PCR of the converter and nonconverter total RNA 
(Invitrogen) as described in Example 3 with the exception of 
using biotinalyted oligo dT as a primer (Promega) , These 
were used as a probe to hybridize with cloned DNA. 

3) Plasmid DNA was digested with restriction enzyme 
EcoRl and run on agarose gels. Gels were simultaneously 
dried and transferred to two nylon membranes (Biodyne B®) . 
One membrane was hybridized with converter probe and the 
other with nonconverter probe. Membranes were UV- cross linked 
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(auto crosslink setting, 254 nm, Stratagene, Stratalinker ) 
before hybridization. 

Alternatively, the inserts were PGR amplified from each 
plasmid using the sequences located on both arms of p-GEM 
plasmid, T3 and SP6, as primers. The PGR products were 
analyzed by running on a 96 well Ready-to-run agarose gels. 
The confirmed inserts were dotted on two nylon membranes . 
One membrane was hybridized with converter probe and the 
other with nonconverter probe. 

4) The membranes were hybridized and washed following 
manufacture's instruction with the modification of washing 
stringency (Enzo MaxSence^^ kit, Enzo Diagnostics, Inc, 
Farmingdale, NY) . The membranes were prehybridized with 
hybridization buffer (2x SSC buffered fonnamide/ containing 
detergent and hybridization enhancers) at 42°G for 30 min 
and hybridized with lOfil denatured probe overnight at 42°C. 
The membranes then were washed in IX hybridization wash 
buffer 1 time at room temperature for 10 min and 4 times at 
68°C for 15 min. The membranes were ready for the 
detection. 

5) The washed membranes were detected by alkaline 
phosphatase labeling followed by NBT/BCIP colometric 
detection as described in manufacture's detection procedure 
(Enzo Diagnostics, Inc.) . The membranes were blocked for one 
hour at room temperature with Ix blocking solution, washed 3 
times with IX detection reagents for 10 min, washed 2 times 
with Ix predevelopment reaction buffer for 5 min and then 
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developed the blots in developing solution for 3 0-45 min 
until the dots appear. All reagents were provided by 
manufacture (Enzo Diagnostics, Inc) . In Addition, large 
scale reverse Southern assay was also performed using KPL 
southern hybridization and detection kit'^'^ following 
manfacturer' s instruction (KPL, Gaithersburg, Maryland). 

EXAMPLE 7: CHARACTERIZATION OF CLONES - NORTHERN BLOT 
ANALYSIS 

Alternative to Southern Blot analysis, some membranes 
were hybridized and detected as described in the example of 
Northern blotting assays. Northern Hybridization was used 
to detect mRNA differentially expressed in Nicotiana as 
follows . 

A random priming method was used to prepare probes from 
cloned p450 (Megaprime*" DNA Labelling Systems, Amersham 
Biosciences) . 

The following components were mixed: 2 5ng denatured DNA 
template; 4ul of each unlabeled dTTP, dGTP and dCTP; 5ul of 
reaction buffer; P^"-labelled dATP and 2ul of Klenow I; and 
H2O, to bring the reaction to 50ul. The mixture was 
incubated in 37<=>C for 1-4 hours, then stopped with 2]il of 
0.5 M EDTA. The probe was denatured by incubating at 95^C 
for 5 minutes before use. 
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RNA samples were prepared from ethylene treated and 
non- treated fresh leaves of several pairs of tobacco lines. 
In some cases poly A+ enriched RNA was used. Approximately 
ISug" total RNA or 1 . 8ug mRNA (methods of RNA and mRNA 
extraction as described in Example 5) were brought to equal 
volume with DEPC HgO (5-10 . The same volume of loading 
buffer (1 X MOPS; 18.5 % Formaldehyde; 50 % Formamide; 4 % 
Ficoll400; Bromophenolblue) and 0.5 ul EtBr (0.5 ug/yal) were 
added. The samples were subsequently denatured in 
preparation for separation of the RNA by electrophoresis. 

Samples were subjected to electrophoresis on a 
formaldehyde gel (1 % Agarose, 1 x MOPS, 0.6 M Formaldehyde) 
with IXMOP buffer (0.4 M Morpholinopropanesulf onic acid; 0.1 
M Na-acetate-3 x H20; 10 mM EDTA; adjust to pH 7.2 with 
NaOH) . RNA was transferred to a HybQnd-N+ membrane (Nylon, 
Amersham Pharmacia Biotech) by capillary method in 10 X SSC 
buffer (1.5 M NaCl; 0.15 M Na-citrate) for 24 hours. 
Membranes with RNA samples were UV-crosslinked (auto 
crosslink setting, 254 nm, Stratagene, Stratalinker) before 
hybridization. 

The membrane was prehybridized for 1-4 hours at 42°C 
with 5-10 ml prehybridization buffer (5 x SSC; 50 % 
Formamide; 5 x Denhardt ^ s-solution; 1 % SDS; lOOjig/ml heat- 
denatured sheared non- homologous DNA) . Old prehybridization 
buffer was discarded, and new prehybridization buffer and 
probe were added. The hybridization was carried out over 
night at 42°C. The membrane was washed for 15 minutes with 
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2 X SSC at room temperature, followed by a wash with 2 x 
SSC- 



A major focus of the invention was the discovery of 
novel genes that may be induced as a result of ethylene 
treatment or play a key role in tobacco leaf quality and 
constituents. As illustrated in the table below, Northern 
blots and reverse Southern Blot were useful in determining 
which genes were induced by ethylene treatment relative to 
non- induced plants. Interestingly, not all fragments were 
affected similarly in the converter and nonconverter . The 
cytochrome p450 fragments of interest were partially 
sequenced to determine their structural relatedness . This 
information was used to subsequently isolate and 
characterize full length gene clones of interest. 
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D56-AC7 (SEO ID No : 35) 
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D56-AG11 (SEO ID No: 31) 




D56-AC12 (SEO ID No : 45) 
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D70A-AB5 (SEO ID No : 95) 


+ 


D73-AC9 (SEO ID No: 43) 


+ 


D70A-'AA12 (SEO ID No : 131) 


+ 


D73A-AG3 (SEO ID No : 12 9) 


+ 


D34-52 (SEO ID No: 61) 


+ 


D55-AG6 (SEO ID No: 51) 


+ 



Northern analysis was performed using full length 
15 clones on tobacco tissue obtained from converter and 

nonconverter burley lines that were induced by ethylene 
treatment. The purpose was to identify those full length 
clones that showed elevated expression in ethylene induced 
converter lines relative to ethylene induced converter lines 
20 relative to ethylene induced nonconverter burley lines. By 

so doing, the functionality relationship of full length 
clones may be determined by comparing biochemical 
differences in leaf constituents between converter and 
nonconverter lines. As shown in table below, six clones 
25 showed significantly higher expression, as denoted by ++ and 

in converter ethylene treated tissue than that of 
nonconverter treated tissue, denoted by +. All of these 
clones showed little or no expression in converter and 
nonconverter lines that were not ethylene treated. 



30 


Full Length 


Converter 


Nonconverter 




Clones 
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D101-BA2 


++ 


+ 


D207-AA5 


++ 


+ 


D208-AC8 


+++ 




D237-AD1 


+ + 


+ 


D89-AB1 






D90A-BB3 


++ 





EXAMPLE 8: IMMUNODETECTION OF p450S ENCODED BY THE CLONED 
GENES 

Peptide regions corresponding to 20-22 amino acids in 
length from three p450 clones were selected for 1) having 
lower or no homology to other clones and 2) having good 
hydrophilicity and antigenicity. The amino acid sequences 
of the peptide regions selected from the respective p450 
clones are listed below. The synthesized peptides were 
conjugated with KHL and then injected into rabbits. 
Antisera were collected 2 and 4 weeks after the 4*"*" 
injection (Alpha Diagnostic Intl. Inc. San Antonio, TX) . 

D234-AD1 DIDGSKSKLVKAHRKIDEILG 
D9 0a-BB3 RDAFREKETFDENDVEELNY 
D8 9 - ABl FKNNGDEDRHFSQKLGDLADKY 

Antisera were examined for crossreactivity to target 
proteins from tobacco plant tissue by Western Blot analysis. 
Crude protein extracts were obtained from ethylene treated 
(0 to 4 0 hours) middle leaves of converter and nonconverter 
lines. Protein concentrations of the extracts were 
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determined using RC DC Protein Assay Kit (BIO-RAD) following 
the manufacturer's protocol. 

Two micrograms of protein were loaded onto each lane 
and the proteins separated on 10% - 20% gradient gels using 
the Laemmli SDS-PAGE system. The proteins were transferred 
from gels to PROTRANd) Nitrocellulose Transfer Membranes 
(Schleicher & Schuell) with the Trans-Blot® Semi-Dry cell 
(BIO-RAD) • Target p450 proteins were detected and 
visualized with the ECL Advance™ Western Blotting Detection 
Kit (Amersham Biosciences) . Primary antibodies against the 
synthetic-KLH conjugates were made in rabbits. Secondary 
antibody against rabbit IgG, coupled with peroxidase, was 
purchased from Sigma. Both primary and secondary antibodies 
were used at 1:1000 dilutions. Antibodies showed strong 
reactivity to a single band on the Western Blots indicating 
that the antisera were monospecific to the target peptide of 
interest. Antisera were also crossreactive with synthetic 
peptides conjuated to KLH. 

EXAMPLE 9; NUCLEIC ACID IDENTITY AND STRUCTURE RELATEDNESS 
OF ISOLATED NUCLEIC ACID FRAGMENTS 

Over 100 cloned p450 fragments were sequenced in 
conjunction with Northern blot analysis to determine their 
structural relatedness. The approach used utilized forward 
primers based either of two common p450 motifs located near 
the carboxyl- terminus of the p450 genes. The forward 
primers corresponded to cytochrome p450 motifs FXPERF or 
GRRXCP(A/G) as denoted in Figure 1. The reverse primers 
used standard primers from either the plasmid, SP6 or T7 
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located on both arms of pGEM"^" plasmid, or a poly A tail. 
The protocol used is described below. 

Spectrophotometry was used to estimate the 
concentration of starting double stranded DNA following the 
manufacturer's protocol (Beckman Coulter), The template was 
diluted with water to the appropriate concentration, 
denatured by heating at 95* C for 2 minutes, and 
subsequently placed on ice. The sequencing reaction was 
prepared on ice using 0.5 to lOpl of denatured DNA template, 
2 pi of 1.6 pmole of the forward primer, 8 \il of DTCS Quick 
Start Master Mix and the total volume brought to 20 pi with 
water. The thermocycling program consisted of 30 cycles of 
the follow cycle: 95* C for 20 seconds, 50" C for 20 
seconds, and 60* C for 4 minutes followed by holding at 4' 
C. 

The sequence was stopped by adding 5 pi of stop buffer 
(equal volume of 3M NaOAc and lOOmM EDTA and 1 pi of 2 0 
mg/ml glycogen) . The sample was precipitated with 60 pi of 
cold 95% ethanol and centrifuged at 6000g for 6 minutes. 
Ethanol was discarded. The pellet was 2 washes with 200 pi 
of cold 70% ethanol. After the pellet was dry, 40 pi of SLS 
solution was added and the pellet was resuspended. A layer 
of mineral oil was over laid. The sample was then, placed 
on the CEQ 8000 Automated Sequencer for further analysis. 

In order to verify nucleic acid sequences, nucleic acid 
sequence was re-sequenced in both directions using forward 
primers to the FXPERF or GRRXCP(A/G) region of the p450 gene 
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or reverse primers to either the plasmid or poly A tail. All 
sequencing was performed at least twice in both directions. 

The nucleic acid sequences of cytochrome p450 fragments 
were compared to each other from the coding region 
corresponding to the first nucleic acid after the region 
encoding the GRRXCP(A/G) motif through to the stop codon. 
This region was selected as an indicator of genetic 
diversity among p450 proteins. A large number of 
genetically distinct p450 genes, in excess of 70 genes, were 
observed, similar to that of other plant species. Upon 
comparison of nucleic acid sequences, it was found that the 
genes could be placed into distinct sequences groups based 
on their sequence identity. It was found that the best 
unique grouping of p450 members was determined to be those 
sequences with 75% nucleic acid identity or greater (shown 
in Table I) . Reducing the percentage identity resulted in 
significantly larger groups. A preferred grouping was 
observed for those sequences with 81% nucleic acid identity 
or greater, a more preferred grouping 91% nucleic acid 
identity or greater, and a most preferred grouping for those 
sequences 99% nucleic acid identity of greater. Most of the 
groups contained at least two members and frequently three 
or more members. Others were not repeatedly discovered 
suggesting that approach taken was able to isolated both low 
and high expressing mRHA in the tissue used. 

Based on 75% nucleic acid identity or greater, two 
cytochrome p450 groups were found to contain nucleic acid 
sequence identity to previously tobacco cytochrome genes 
that genetically distinct from that within the group. Group 
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23, showed nucleic acid identity, within the parameters used 
for Table I, to prior GenBank sequences of GI: 1171579 

(CAA64635) and GI: 14423327 (or AAK62346) by Czernic et al 
and Ralston et al, respectively. GI; 1171579 had nucleic 
acid identity to Group 23 members ranging 96.9% to 99.5% 
identity to members of Group 23 while GI: 14423327 ranged 
95.4% to 96.9% identity to this group. The members of Group 
31 had nucleic acid identity ranging from 76.7% to 97.8% 
identity to the GenBank reported sequence of GI; 14423319 

(AAK62342) by Ralston et al . None of the other p450 
identity groups of Table 1 contained parameter identity, as 
used in Table 1, to Nicotiana p450s genes reported by 
Ralston et al, Czernic et al., Wang et al or LaRosa and 
Smigocki . 

As shown in Figure 76, consensus sequence with 
appropriate nucleic acid degenerate probes could be derived 
for group to preferentially identify and isolate additional 
members of each group from Nicotiana plants. 
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Table I: JSTic otiana td45Q Nucleic Acid Secmence Identity 
Groups 

GROUP FRAGMENTS 

1 D58-BG7 (SEQ ID No . : 1) , D58-AB1 (SEQ ID No.:3); D58~BE4 
( SEQ ID No . : 7 ) 

2 D56-AH7 {SEQ ID No . : 9 ) ; Dl3a-5 (SEQ ID No.: 11) 

3 D56-AG10 {SEQ ID No.:13); D35-33 (SEQ ID No.:15); 
D34-62 (SEQ ID No.;17) 

4 D56-AA7 (SEQ ID No.:19); D56-AE1 (SEQ ID No.:21); 
185-BD3 (SEQ IDNo.:143) 

5 D35-BB7 (SEQ ID No,:23); D177-BA7 (SEQ ID No.:25); 
D56A-AB6 (SEQ ID No.:27); D144-AE2 (SEQ ID No.:29) 

6 D56-AG11 (SEQ ID No.:31); D179-AA1 (SEQ ID No.:33) 

7 D55-AC7 (SEQ ID No.:35); D144-AD1 (SEQ ID No.;37) 

8 D144-AB5 (SEQ ID No.:39) 

9 D181-AB5 (SEQ ID No.:41); D73-Ac9 (SEQ ID No.:43) 

10 D56-AC12 (SEQ ID No.:45) 

11 D58-AB9 (SEQ ID No . : 47 ) ; D56-AG9 (SEQ ID No-:49); 
D56-AG6 (SEQ IDNo.:51); D35-BG11 (SEQ IDNo.:53); D35-42 
(SEQ ID No.;55); D35-BA3 (SEQ ID No.:57); D34-57 (SEQ ID 

No.:59); D34-52 (SEQ IDNo.:61); D34-25 (SEQ IDNo.:63) 

12 D56-AD10 (SEQ ID No.:65) 

13 55-AAll (SEQ ID No.:67) 

14 D177-BD5 (SEQ ID No,:59); D177-BD7 (SEQ ID No.:83) 
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15 D56A-AG10 (SEQ IDNo.:71); D58-BC5 (SEQ IDNo.:73); 
D58-AD12 (SEQ ID No.:75) 

16 D56-AC11 (SEQ ID No.:77); D35-39 (SEQ ID No.:79); 
D58-BH4 (SEQ ID No.:81); D56-AD5 (SEQ ID No.:87) 

17 D73A-AD6 (SEQ IDNo.:89); D70A-BA11 (SEQ IDNo.:91) 

18 D70A-AB5 (SEQ IDNo.:95); D70A-AA8 (SEQ IDNo.:97) 

19 D70A-AB8 (SEQ IDNo.:99); D70A-BH2 (SEQ IDNo.:101); 
D70A-AA4 (SEQ ID No.:103) 

20 D70A-BA1 (SEQ IDNo.:105); D70A-BA9 (SEQ IDNo.:107) 

21 D70A-BD4 (SEQ ID No .: 109 ) 

22 D181-AC5 (SEQ IDNo.:lll); D144-AH1 (SEQ ID]SIO.:113); 
D34-65 (SEQ IDNo.:115) 

23 D35-BG2 ( SEQ ID No . : 117 ) 

24 D73A-AH7 (SEQ IDNo.:119) 

25 D58-AA1 (SEQ IDNo.:121); D185-BC1 (SEQ IDNo.:133); 
D185-BG2 (SEQ IDNo.:135) 

26 D73-AE10 (SEQ ID No.:123) 

27 D56-AC12 (SEQ IDNo.:125) 

28 D177-BF7 (SEQ IDNo.:127); D185-BE1 (SEQ IDNo.:137); 
D185-BD2 (SEQ IDNo.:139) 

29 D73A-AG3 (SEQ IDNo.:129) 

30 D70A-AA12 (SEQ ID No.:131); D176-BF2 (SEQ ID No.:85) 

31 D176-BC3 (SEQ ID No.:145) 

32 D176-BB3 (SEQ ID No.: 147) 

33 D186-AH4 (SEQ ID No.:5) 
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IgXAMPLE IQ: RELATED AMINO ACID SEQUENCE IDENTITY OF 
ISOLATED NUCLEIC ACID FRAGMENTS 

5 The amino acid sequences of nucleic acid sequences 

obtained for cytochrome p450 fragments from Example 8 were 
deduced. The deduced region corresponded to the amino acid 
immediately after the GXRXCP(A/G) sequence motif to the end 
of the carboxyl- terminus, or stop codon. Upon comparison of 

10 sequence identity of the fragments, a unique grouping was 

observed for those sequences with 70% amino acid identity or 
greater. A preferred grouping was observed for those 
sequences with 80% amino acid identity or greater, more 
preferred with 90% amino acid identity or greater, and a 

15 most preferred grouping for those sequences 99% amino acid 

identity of greater. The groups and corresponding amino 
acid sequences of group members are shown in Figure 2 , 
Several of the unique nucleic acid sequences were found to 
have complete amino acid identity to other fragments and 

20 therefore only one member with the identical amino acid was 

reported. 

The amino acid identity for Group 19 of Table II 
corresponded to three distinct groups based on their nucleic 
25 acid sequences. The amino acid sequences of each group 

member and their identity is shown in Figure. 77. The amino 
acid differences are appropriated marked. 

At least one member of each amino acid identity group 
30 was selected for gene cloning and functional studies using 

plants. In addition, group members that are differentially 
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affected by ethylene treatment or other biological 
differences as assessed by Northern and Southern analysis 
were selected for gene cloning and functional studies. To 
assist in gene cloning, expression studies and whole plant 
5 evaluations, peptide specific antibodies will be prepared on 

secjuence identity and differential sequence. 
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Table II; Nicotiana p45Q Amino Acid Sequence Identity Groups 

GROUP FRAGMENTS 

1 D58-BG7 (SEQ ID No. : 2 ) , D58-AB1 (SEQ ID No,:4) 

2 D58-BE4 (SEQ IDNo.:8) 

3 D55-AH7 (SEQ ID No.:10); D13a-5 (SEQ ID No.:12) 

4 D56-AG10 (SEQ ID 
No.:14); D34-62 (SEQ IDNo,:18) 

5 D56-AA7 (SEQ ID NO-:20); D56-AE1 (SEQ ID ]SIO.:22); 185- 
BD3 (SEQ ID No. :144) 

6 D35-BB7 (SEQ ID No.:24); D177-BA7 (SEQ ID No.:26); 
D56A-AB6 (SEQ ID No.:28); D144~AE2 (SEQ ID No.:30) 

7 D56-AG11 (SEQ ID No.:32); D179-AA1 (SEQ ID No,:34) 

8 D56-AC7 (SEQ ID No.:36); D144-AD1 (SEQ ID No.:38) 

9 D144-AB5 (SEQ ID No.:40) 

10 D181-AB5 (SEQ ID No.: 42); D73~Ac9 (SEQ ID No.: 44) 

11 D56-AC12 (SEQ ID No.:45) 

12 D58-AB9 (SEQ ID No.:48); D56-AG9 (SEQ ID No. :50); D56- 
AG6 (SEQ ID No.:52); D35-BG11 (SEQ ID No.:54); D35-42 (SEQ 
ID No.:56); D35-BA3 (SEQ ID No.:58); D34-57 (SEQ ID 
No.:60); D34-52 (SEQ IDNo.:62) 

13 D56AD10 (SEQ ID No.:66) 

14 56-AAll (SEQ IDNo.;68) 

15 D177-~BD5 (SEQ ID No.:70); D177-BD7 (SEQ ID No,:84) 
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16 D56A-AG10 (SEQ ID No.:72); D58-BC5 (SEQ ID No,:74); 
D58-AD12 (SEQ ID No.:76) 

17 D56-AC11 (SEQ IDNo.:78); D56-AD6 (SEQ IDNo,:88) 

18 D73A-AD6 (SEQ ID No. 90:) 

19 D70A-AB5 (SEQ IDNo.:96); D70A-AB8 (SEQ IDNo.:100); 
D70A-BH2 (SEQ ID No.: 102); D70A-AA4 (SEQ ID No.: 104); D70A- 
BAl (SEQ ID No.:106); D70A-BA9 (SEQ ID No.:108) 

20 D70A-BD4 (SEQ IDNo.:110) 

21 D181-AC5 (SEQ IDNo.:112); D144-AH1 (SEQ IDNo.:114); 
D34-65 (SEQ ID No.: 116) 



22 


D35-BG2 (SEQ ID No.: 


118) 






23 


D73A-AH7 (SEQ ID No. 


:120) 






24 


D58-AA1 (SEQ ID No . : 


122) ; 


D185-BC1 (SEQ ID No.: 


134) ; 


D185 


-BG2 (SEQ IDNo.:136) 








25 


D73-AE10 (SEQ ID No. 


:124) 






26 


D56-AC12 (SEQ ID No. 


:126) 






27 


D177-BF7 (SEQ ID No. 


:128) ; 


185-BD2 (SEQ ID No.: 


140) 


28 


D73A-AG3 (SEQ ID No. 


:130) 






29 


D7 0A-AA12 (SEQ ID No 


. :132) 


; D176-BF2 (SEQ ID No 


. :86) 


30 


D176-BC3 (SEQ ID No. 


:146) 






31 


D176-BB3 (SEQ ID No. 


:148) 






32 


D186-AH4 (SEQ ID No. 


:6) 
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EXAMPLE 11; R ELATED AMINO ACID SEQUENCE IDENTITY OF FULL 
LENGTH CLONES 

The nucleic acid sequence of full length Nicotiana 
genes cloned in Example 5 were deduced for their entire 
amino acid sequence. Cytochrome p450 genes were identified 
by the presence of three conserved p450 domain motifs, which 
corresponded to UXXRXXZ, PXRFXF or GXRXC at the carboxyl- 
terminus where U is E or K, X is any amino acid and Z is P, 
T, S or M. It was also noted that two of the clones 
appeared nearly complete but lacked the appropriate stop 
codon, D130-AA1 and D101-BA2, however but both contained all 
three p450 cytochrome domains. All p450 genes were 
characterized for amino acid identity using a BLAST program 
comparing their full length sequences to each other and to 
known tobacco genes. The program used the NCBI special 
BLAST tool (Align two sequences (bl2seq) , 

http ; / /www , ncbi . nlm . nih . gov/blas t /bl2 sea/bl2 . html ) . Two 
sequences were aligned under BLASTN without filter for 
nucleic acid sequences and BLASTP for amino acid sequences. 
Based on their percentage amino acid identity, each sequence 
was grouped into identity groups where the grouping 
contained members that shared at least 85% identity with 
another member. A preferred grouping was observed for those 
sequences with 9 0% amino acid identity or greater, a more 
preferred grouping had 95% amino acid identity or greater, 
and a most preferred grouping had those sequences 99% amino 
acid identity or greater. Using these criteria, 25 unique 
groups were identified and are depicted in Table III. 

Within the parameters used for Table III for amino acid 
identity, three groups were found to contain greater than 
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85% or greater identity to known tobacco genes. Members of 
Group 5 had up to 96% amino acid identity for full length 
sequences to prior GenBank sequences of GI: 14423327 (or 
AAK62346) by -Ralston et al . Group 23 had up to 93% amino 
acid identity to GI: 14423328 (or AAK62347) by Ralston et al. 
and Group 24 had 92% identity to GI: 14423318 (or AAK52343> 
by Ralston et al. 

Table III: Amino Acid Secfuence Identity Groups of Full 
Length Hicotiana p450 Genes 



1 D208-AD9 (SEQ. ID. No. 224); D120-AH4 (SEQ. ID. No. 
180); D121-AA8 (SEQ. ID. No. 182), D122-AF10 (SEQ. ID. 
No, 184); D103-AH3 (SEQ. ID. No. 222); D208-AC8 (SEQ. 
ID. No. 218); D-235-ABI (SEQ. ID. No. 246) 

2 D244-AD4 (SEQ. ID, No. 250); D244-AB6 (SEQ. ID. No. 
274) ; D285-AA8; D285-AB9; D268-AE2 (SEQ. ID. No. 270) 

3 D100A-AC3 (SEQ. ID. No. 168); D100A-BE2 

4 D205-BE9 (SEQ, ID. No. 276); D205-BG9 (SEQ. ID. No. 
202); D205-AH4 (SEQ. ID. No. 294) 

5 D259-AB9 (SEQ. ID. No. 260) ; D257-AE4 (SEQ. ID. No. 
268); D147-AD3 (SEQ. ID. No. 194) 

6 D249-AE8 (SEQ. ID. No, 256); D-248~AA6 (SEQ. ID. No. 
254) 

7 D233-AG7 (SEQ. ID. No. 266; D224-BD11 (SEQ. ID. No. 
240) ; DAFIO 

8 D105-AD6 (SEQ. ID. No. 172); D215-AB5 (SEQ. ID. No. 
220); D135-AE1 (SEQ. ID. No. 190) 

9 D87A-AF3 (SEQ. ID. No. 216), D210-BD4 (SEQ. ID, No. 
262) 
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10 D89-AB1 (SEQ. ID. No. 150); D89-AD2 (SEQ. ID. No. 152); 
163-AGll (SEQ. ID. No. 198); 163-AF12 (SEQ. ID. No. 

196) 

11 D267-AF10 (SEQ. ID. No. 296); D96-AC2 (SEQ. ID. No. 

5 160); D96-AB6 (SEQ. ID. No. 158); D207-AA5 (SEQ. ID. 

NO. 204); D207-AB4 (SEQ. ID. No. 206); D207-AC4 (SEQ. 
ID. No. 208) 

12 D98-AG1 (SEQ. ID. No. 164); D98-AA1 (SEQ. ID. No. 162) 

13 D209-AA12 (SEQ. ID. No. 212); D209-AA11; D209-AH10 
10 (SEQ. ID. NO. 214); D209-AH12 (SEQ. ID. NO- 232); 

D90a-BB3 (SEQ. ID. No. 154) 

14 D129-AD10 (SEQ. ID. No. 188); D104A-AE8 (SEQ. ID. No. 
170) 

15 D228-AH8 (SEQ. ID. No. 244); D228-AD7 (SEQ. ID. No. 
15 241), D250-AC11 (SEQ. ID. No. 258); D247-AH1 (SEQ. 

ID. No. 252) 

16 D128-AB7 (SEQ. ID. No. 186) ; D243-AA2 (SEQ. ID. No. 
248); D125-AF11 (SEQ. ID. No. 228) 

17 D284-AH5 (SEQ. ID. No. 298); D110-AF12 (SEQ. ID. No. 
20 176) 

18 D221-BB8 (SEQ. ID. No. 234) 

19 D222-BH4 (SEQ. ID. No. 236) 

20 D134-AE11 (SEQ. ID. No. 230) 

21 D109-AH8 (SEQ. ID. No. 174) 
25 22 D136-AF4 (SEQ. ID. No. 278) 

23 D237-AD1 (SEQ. ID. No. 226) 

24 D112-AA5 (SEQ. ID. No. 178) 

25 D283-AC1 (SEQ. ID. No. 272) 

30 The full length genes were further grouped based on the 

highly conversed amino acid homology between UXXRXXZ p450 
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domain and GXRXC p450 domain near the end the carboxyl- 
tentiinus. As shown in Figure 3, individual clones were 
aligned for their sequence homology between the conserved 
"domains relative to each other and placed in distinct 
identity groups. In several cases, although the nucleic 
acid sequence of the clone was unique, the amino acid 
sequence for the region was identical. The preferred 
grouping was observed for those sequences with 90% amino 
acid identity or greater, a more preferred group had 95% 
amino acid identity or greater, and a most preferred 
grouping had those sequences 99% amino acid identity of 
greater. The final grouping was similar to that based on 
the percent identity for the entire amino acid sequence of 
the clones except for Group 17 (of Table III) which was 
divided into two distinct groups. 

Within the parameters used for amino acid identity in 
Table IV, three groups were found to contain 90% or greater 
identity to known tobacco genes. Members of Group 5 had up 
to 93.4% amino acid identity for full length sequences to 
prior GenBank sequences of GI: 14423326 (AAK62346) by Ralston 
et al. Group 23 had up to 91.8% amino acid identity to 
GI: 14423328 (or AAK62347) by Ralston et al . and Group 24 had 
98.8% identity to GI; 14423318 (or AAK62342) by Ralston et 
al. 



65 



wo 2005/038033 



PCT/US2004/034065 



'Pahle IV: Amino Acid Sequence Identity Groups of Regions 
between Conserved Domains of Nicotiana p45Q Genes 

1 1 D208-AD9 (SEQ. ID. No. 224); D120-AH4 (SEQ. ID. No. 

180); D121-AA8 (SEQ. ID. No. 182), D122-AF1G (SEQ. ID. 
No. 184); D103-AH3 (SEQ. ID. No. 222); D208-AC8 (SEQ. 
ID. No. 218); D-235-ABI (SEQ. ID. No. 246) 

2 D244-AD4 (SEQ. ID. No. 250); D244-AB6 (SEQ. ID. No. 
274) ; D285-AA8; D285-AB9; D268-AE2 (SEQ. ID. No. 270) 

3 D100A-AC3 (SEQ. ID. No. 168); D100A-BE2 

4 D205-BE9 (SEQ. ID. No. 276); D205-BG9 (SEQ. ID. No. 
202); D205-AH4 (SEQ. ID. No. 294) 

5 D259-AB9 (SEQ. ID. No. 260) ; D257-AE4 (SEQ. ID. No. 
268); D147-AD3 (SEQ. ID. No. 194) 

6 D249-AE8 (SEQ. ID. No. 256); D-248-AA6 (SEQ. ID. No. 
254) 

7 D233-AG7 (SEQ. ID. No. 266; D224-BD11 (SEQ. ID. No. 
240) ; DAFIO 

8 D105-AD6 (SEQ. ID. No. 172); D215-AB5 (SEQ. ID. No. 
220); D135-AE1 (SEQ. ID. No. 190) 

9 D87A-AF3 (SEQ. ID. No. 216), D210-BD4 (SEQ. ID. No. 

262) 

10 D89-AB1 (SEQ. ID. No. 150); D89-AD2 (SEQ. ID. No. 152); 
163-AGll (SEQ. ID. No. 198); 163-AF12 (SEQ. ID. No. 
196) 

11 D267-AF10 (SEQ. ID. No. 296); D96-AC2 (SEQ. ID. No. 
160); D96-AB6 (SEQ. ID. No. 158); D207-AA5 (SEQ. ID. 
No. 204); D207-AB4 (SEQ. ID. No. 206); D207-AC4 (SEQ. 
ID. No. 208) 

12 D98-AG1 (SEQ. ID. No. 164); D98-AA1 (SEQ. ID. No. 162) 
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13 


D209-AA12 


(SEQ. ID. No. 212) ; 


D209-AA11; 


D209-AH10 






(SEQ. ID. 


No. 214) ; D209~AH12 


(SEQ. ID. 


No. 232); 






D90a-BB3 


(SEQ. ID. No. 154) 








14 


D129-AD10 


(SEQ. ID. No. 188) 


; D104A-AE8 


(SEQ, ID. No. 


5 




170) 










15 


D228~AH8 


(SEQ- ID. No. 244) ; 


D228-AD7 


(SEQ. ID. No. 






241), D250-AC11 (SEQ. ID. No 


, 258) ; D247-AH1 (SEQ. 






ID, No. 252) 








16 


D128-AB7 


(SEQ. ID. No. 186) ; 


D243-AA2 


(SEQ. ID. No. 


10 




248); D125-AF11 (SEQ. ID. No 


. 228) 






17 


D284-AH5 


(SEQ. ID. No. 298) ; 


D110-AF12 


(SEQ- ID. No. 






176) 










18 


D221-BB8 


(SEQ. ID. No. 234) 








19 


D222-BH4 


(SEQ. ID. No. 236) 






15 


20 


D134-AE11 


(SEQ- ID, No. 230) 








21 


D109-AH8 


(SEQ. ID. No. 174) 








22 


D136-AF4 


(SEQ. ID. No. 278) 








23 


D237-AD1 


(SEQ. ID. No. 226) 








24 


D112-AA5 


(SEQ. ID. No. 178) 






20 


25 


D283-AC1 


(SEQ. ID. No. 272) 








26 


D110-AF12 


(SEQ. ID. No. 176) 







EXAMPLE 12: NICOTIANA CYTOCHROME P450 CLONES LACKING ONE OR 
MORE OF THE TOBACCO CYTOCHROME P450 SPECIFIC DOMAINS 

25 

Four clones had high nucleic acid homology, ranging -90% 
to 99% nucleic acid homology, to other tobacco cytochrome 
genes reported in Table III. The four clones included D136- 
AD5, D138-AD12, D243-AB3 and D250-AC11. However, due to a 
30 nucleotide frameshift these genes did not contain one or 

more of three C- terminus cytochrome p450 domains and were 
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excluded from identity groups presented in Table III or 
Table IV. 

The amino acid identity of one clone, D95-AG1, did not 
contain the third domain, GXRXC, used to group p450 tobacco 
genes in Table III or Table IV. The nucleic acid homology 
of this clone had low homology to other tobacco cytochrome 
genes. This clone represents a novel and different group of 
cytochrome p450 genes in Nicotiana. 

EXAMPLE 13: USE OF NICOTIANA CYTOCHROME P45Q FRAGMENTS A3SID 
CLONES IN ALTERED REGULARTION OF TOBACCO PROPERTIES 

The use of tobacco p450 nucleic acid fragments or whole 
genes are useful in identifying and selecting those plants 
that have altered tobacco phenotypes or tobacco constituents 
and, more importantly, altered metabolites. Transgenic 
tobacco plants are generated by a variety of transformation 
systems that incorporate nucleic acid fragments or full 
length genes, selected from those reported herein, in 
orientations for either down- regulation, for example anti- 
sense orientation, or over-expression for example, sense 
orienation. For over- express ion to full length genes, any 
nucleic acid sequence that encodes the entire or a 
functional part or amino acide sequence of the full-length 
genes described in this invention are desired that are 
effective for increasing the expression of a certain enzyme 
and thus resulting in phenotypic effect within Nicotiana. 
Nicotiana lines that are homozygous lines are obtained 
through a series of backcrossing and assessed for phenotypic 
changes including, but not limited to, analysis of 
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endogenous p450 UNA, transcripts, p450 expressed peptides 
and concentrations of plant metabolites using techniques 
commonly avaiable to one having ordinary skill in the art. 
The changes exhibited in the tobacco plans provide 
information on the functional role of the selected gene of 
interest or are of a utility as a preffered Nicotiana plant 
species , 

EXAMPLE 14, Identification of genes INDUCED IN ETHYLENE TREATED 
CONVERTER LINES 

High density oligonucleotide array technology, Af fymetrix 
GeneChip® (Af fymetrix Inc., Santa Clara, CA) array, was used 
for quantitative and highly parallel measurements of gene 
expression. In using this technology, nucleic acid arrays 
were fabricated by direct synthesis of oligonucleotides on a 
solid surface. This solid-phase chemistry is able to produce 
arrays containing hundreds of thousands of oligonucleotide 
probes packed at extremely high densities on a chip referred 
to as GeneChip®. Thousands of genes can be simultaneously 
screened from a single hybridization. Each gene is typically 
represented by a set of 11-25 pairs of probes depending upon 
size. The probes are designed to maximize sensitivity, 
specificity, and reproducibility, allowing consistent 
discrimination between specific and background signals, and 
between closely related target sequences. 



69 



wo 2005/038033 



PCT/US2004/034065 



Af fymetrix GeneChip hybridization experiments involve the 
following steps: design and production of arrays, preparation 
of f luorescently labeled target from RNA isolated from the 
biological specimens, hybridization of the labeled target to 
the GeneChip, screening the array, and analysis of the scanned 
image and generation of gene expression profiles. 

A, Designing and Custom making Af fymetrix GeneChip 

A GeneChip CustomExpress Advantage Array was custom 
made by Af fymetrix Inc. (Santa Clara, CA) . Chip size was 18 
micron and array format was 100-2187 that can accommodate 
528 probe sets (11, 628 probes) . Except for GenBank 
derived nucleic acid sequences, all sequences were selected 
from our previously identified tobacco clones and all probes 
were custom designed. A total of 400 tobacco genes or 
fragments were selected to be included on the GeneChip. The 
sequences of oligonucleotides selected were based on unique 
regions of the 3' end of the gene. The selected nucleic 
acid sequences consisted of 56 full length p450 genes and 71 
p4 50 fragments that were cloned from tobacco, described in 
(patent applications) . Other tobacco sequences included270 
tobacco ESTs which were generated from suppression 
subtraction library using Clontech SSH kit(BD Biosciences, 
Palo Alto, CA) . Among these genes, some oligonucleotide 
sequences were selected from cytochrome P450 genes listed in 
GenBank. Up to 25 probes were used for each full length gene 
and 11 probes for each fragment. A reduced number of probes 
were used for some clones due to the lack of unique, high 
quality probes. Appropriate control sequences were also 
included on the GeneChip®. 



70 



wo 2005/038033 



PCT/US2004/034065 



The probe Arrays were 25-mer oligonucleotides that were 
directly synthesized onto a glass wafer by a coinbination of 
semiconductor-based photolithography and solid phase 
chemical synthesis technologies. Each array contained up to 
100,000 different oligonucleotide probes. Since 
oligonucleotide probes are synthesized in known locations on 
the array, the hybridization patterns and signal intensities 
can be interpreted in terms of gene identity and relative 
expression levels by the Af fymetrix Microarray Suite® 
software. Each probe pair consists of a perfect match 
oligonucleotide and a mismatch oligonucleotide. The perfect 
match probe has a sequence exactly complimentary to the 
particular gene and thus measures the expression of the 
gene. The mismatch probe differs from the perfect match 
probe by a single base substitution at the center base 
position, which disturbs the binding of the target gene 
transcript. The mismatch produces a nonspecific 
hybridisation signal or background signal that was compared 
to the signal measured for the perfect match 
oligonucleotide . 

B. Sample preparation 

Hybridization experiments were conducted by Genome 
Explorations, Inc. (Memphis, TN) . The RNA samples used in 
hybridization consisted of six pairs of nonconverter/ converter 
isogenic lines that were induced by ethylene treatments. 
Samples included one pair of 4407-25/4407-33 non- treated burly 
tobacco samples, three pairs of ethylene treated 4407-25/4407- 
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33 samples, one pair of ethylene treated dark tobacco NL 
Madole/181 and one pair of ethylene treated burly variety 
PBLBOl/178, Ethylene treatment was as described in Example 1. 

Total RNA was extracted from above mentioned ethylene' 
treated and non- treated leaves using a modified acid phenol 
and chloroform extraction protocol. Protocol was modified to 
use one gram of tissue that was ground and subsequently 
vortexed in 5 ml of extraction buffer (100 mM Tris-HCl, pH 
8.5; 200 mM NaCl; lOmM EDTA; 0.5% SDS) to which 5 ml phenol 
(pH5.5) and 5 ml chloroform was added. The extracted sample 
was centrifuged and the supernatant was saved. This 
extraction step was repeated 2-3 more times until the 
supernatant appeared clear. Approximately 5 ml of chloroform 
was added to remove trace amounts of phenol. KNA was 
precipitated from the combined supernatant fractions by adding 
a 3-fold volume of ETOH and 1/10 volume of 3M NaOAc {pH5.2) 
and storing at -20^0 for 1 hour. After transferring to a 
Corex glass container the RNA fraction was centrifuged at 
9,000 RPM for 45 minutes at 4^0, The pellet was washed with 
70% ethanol and spun for 5 minutes at 9,000 RPM at 4^0. After 
drying the pellet, the pelleted RNA was dissolved in 0 . 5 ml 
RNase free water. The pelleted RNA was dissolved in 0.5 ml 
KNase free water. The quality and quantity of total RNA was 
analyzed by denatured formaldehyde gel and spectrophotometer, 
respectively. The total KDTA samples with 3-5|ig/ul were sent 
to Genome explorations, inc. to do the hybridization. 

C. Hybridization, detection and data output 
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The preparation of labeled cRNA material was performed as 
follows. First and second strand cDNA were synthesized from 5- 
15 ug of total RJSJA using the Superscript Double- Stranded cDNA 
Synthesis Kit (Gibco Life Technologies) and oligo-dT24~T7 (5'- 
GGC CAG TGA ATT GTA ATA CGA CTC ACT ATA GGG AGG CGG-3 ' ) primer 
according to the manufacturer's instructions. 

The cRNA was concurrently synthesized and labeled with 
bio tiny lated UTP and CTP by in vitro transcription using the 
T7 promoter coupled double stranded cDNA as template and the 
T7 RNA Transcript Labeling Kit (ENZO Diagnostics Inc.). 
Briefly, double stranded cDNA synthesized from the previous 
steps were washed twice with 70% ethanol and resuspended in 22 
\il Rnase-free H20. The cDNA was incubated with 4 ]il of lOX 
each Reaction Buffer, Bio tin Labeled Ribonucleotides, DTT, 
Rnase Inhibitor Mix and 2 \il 20X T7 RNA Polymerase for 5 hr at 
372c. The labeled cRNA was separated from unincorporated 
ribonucleotides by passing through a CHROMA SPIN--100 column 
(Clontech) and precipitated at -20^0 for 1 hr to overnight. 

Oligonucleotide array hybridization and analysis were 
performed as follows. The cRNA pellet was resuspended in 10 
lal Rnase-free H20 and 10.0 ]ig was fragmented by heat and ion- 
mediated hydrolysis at 95^0 for 35 mins in 2 00 mM Tris- 
acetate, pH 8.1, 5 00 mM KOAc, 150 mM MgOAc. The fragmented 
CRNA was hybridized for 16hr at 45^C to HG_U95Av2 
oligonucleotide arrays (Affymetrix) containing -12,500 full 
length annotated genes together with additional probe sets 
designed to represent EST sequences. Arrays were washed at 
252c with 6 X SSPE (0.9M NaCl, 60 mMNaH2P04, 6 mM EDTA 0.01% 
Tween 20) followed by a stringent wash at 50«C with 100 mM 
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MES, O.IM [Na-h], 0.01% Tween 20. The arrays were stained with 
phycoerythrein conjugated streptavidin (Molecular Probes) and 
the fluorescence intensities were determined using a laser 
confocal scanner (Hewlett-Packard) . The scanned images were 
analyzed using Microarray software (Af fymetrix) . Sample 
loading and variations in staining were standardized by 
scaling the average of the fluorescent intensities of all 
genes on an array to constant target intensity (250) for all 
arrays used. Data Analysis was conducted using Microarray 
Suite 5.0 (Af fymetrix) following user guidelines. The signal 
intensity for each gene was calculated as the average 
intensity difference, represented by [S(PM - MM) / (number of 
probe pairs) ] , where PM and MM denote perfect-match and 
mismatch probes. 

D. Data Analysis and results 

Twelve sets of hybridizations were successful as 
evidenced by the Expression Report generated using detection 
instruments from Genome Explorations . The main parameters on 
the report included Noise, Scale factor, background, total 
probe sets, number and percentage of present and absent probe 
sets, signal intensity of housekeeping controls. The data was 
subsequently analyzed and presented using software GCOS in 
combination of other Microsoft software. Signal comparison 
between treatment pairs was analyzed. Overall data for all 
respective probes corresponding to genes and fragments of each 
different treatment including replications were compiled and 
compiled expression data such as call of the changes and 
signal log 2 ratio changes were analyzed. 
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A typical application of GeneChip technology is finding 
genes that are differentially expressed in different tissues. 
In the present application, genetic expression variations 
caused by ethylene treatment were determined for pairs of 
converter and nonconverter tobacco lines that included a 4407- 
25/4407-33 hurley variety, PBLBOl/178 burley variety, and a NL 
Madole/181 dark variety. These analyses detected only those 
genes whose expression is significantly altered due to 
biological variation. These analyses employed the Fold change 
(signal ratio) as a major criterion to identify induced genes. 
Other parameters, such as signal intensity, present /absent 
call, were also taken into consideration. 

After analyzing the data for expression differences in 
converter and nonconverter pairs of samples for approximately 
400 genes, the results based on the signal intensities showed 
that only two genes, D121-AA8 and D120-AH4, had reproducible 
induction in ethylene treated converter lines versus non- 
converter lines. To illustrate the differential expression of 
these genes, the data was represented as follows. As shown in 
Table V, the signal of a gene in a converter line, for 
example, burley tobacco variety, 4407«-33, was determined as 
ratio to the signal of a related nonconverter isogenic line, 
4407-25. Without ethylene treatment, the ratio of converter 
to nonconverter signals for all genes approached 1.00. Upon 
ethylene treatment, two genes, D121-AA8 and D120-AH4, were 
induced in converter lines relative to non-converter line as 
determined by three independent analyses using isogenic burley 
lines. These genes have very high homology to each other, 
approximately 99.8% or greater nucleic acid sequence homology. 
As depicted in Table V, their relative hybridization signals 
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in converter varieties ranged from approximately 2 to 12 fold 
higher in converter lines than the signals in their non- 
converter counterparts. In comparison, two actin-like control 
clones, internal controls, were found not to be induced in 
converter lines based on their normalized ratios. In addition, 
a fragment (D35-BG11) , whose sequence in coding region is 
entirely contained in both D121-AA8 and D120-AH4 genes, was 
highly induced in the same samples of paired isogenic 
converter and nonconverter lines. Another isogenic pair of 
burley tobacco varieties, PBLBOl and 178, was shown to have 
the same genes, D121-AA8 and D12 0-AH:4, induced in converter 
samples under ethylene induction. Furthermore, D121-AA8 and 
D120-AH4 genes were preferentially induced in converter lines 
of isogenic dark tobacco pairs, NL Madole and 181, 
demonstrating that ethylene induction of these genes in 
converter lines was not limited to burley tobacco varieties. 
In all cases, the D3 5-BG11 fragment was the most highly 
induced in converter relative to nonconverter paired lines. 
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Table V; A Compa rison of Clone Xnducfcion in Ethylene Treated. 

Convert er and Non-Converter liinRss 



Clones 


No 
Treatment 


Ethylene Treated 
Burlev Exp 1 


Ethylene Treated 
Burley Exp 2 


Ethylene Treated 
Burley Exp 3 


Ethylene Treated 
Dark 


Induced 


33:25 Ratio 


33:25 EtNo 
Ratio Ratio 


33:25 Et:No 
Ratio Ratio 


33:26 EtNo 
Ratio Ratio 


181:NL Et:No 
Ratio Ratio 


D121-AA8 
D120-AH4 


1.03 
tAA 


2.20 2.14 
2.74 1 .90 


13.25 12.90 
18.33 12.74 


5.31 5.16 
4.13 2.87 


17.06 16,60 
11.76 8.17 


Controf 












Actin-Uke 1 
Actin-Like 1 


1.18 
1.09 


1.17 0.99 
1.23 1.12 


0.88 0.74 
0.89 0.81 


0.86 0.73 
1.18 0.11 


1.20 1.02 
1.02 0.93 



EXAMPLE 15: CL ONING RELATD D35-BQ11 PULL LENGTH GENES 

GeneChip hybridization was based on 3' reverse 
transcription (cRNA) . The probes were synthesized on GeneChip 
were chosen from the 3' end of the genes (in the downstream 
1000 nucleotide region) . Therefore, in order to obtain all 
the possible variations of D121~AA8 and D12 0-AH4 clones, 
additional cloning was performed from the tobacco cDNA library 
using 5' sequences. 

The full length genes were cloned from cDNA library 
constructed from 4407-33 ethylene treated tissue as described 
in Example 5 . The Polymerase Chain Reaction method was used as 
follows. The reverse primers were designed based on the 3' 
sequence (including part of untranslated region) the D121-AA8 
gene. The primer of D121-p2 5 ' -AGC AAG ATG ATC TTA GGT TTT 
AA-3' and D121-R-2 5'~CAA GCA AGA TGA TCT TAG GTT TTA ATA AAG 
CTC AGG T-3'. The T3 primer (5'CAA TTA ACC CTC ACT AAA GGG 
3') , located in upstream of the inserts in the plasmid, was 
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used as forward primer The generated PGR products were 
subjected to agarose electrophoresis and the corresponding 
bands of high molecular weight were excised, purified, cloned 
and sequenced. The methods for cloning and sequencing were 
described in Example 4. Nine novel clones were sequenced and 
identified as D425-AB10, D425-AB11, D425-AC9, D425-AC10, D425- 
ACll, D425-AG11, D425--AH7, D425-AH11, and D427-AA5. Each of 
the clones was observed to have 99% or greater nucleic acid 
sequence homology with clones D121-AA8 and D120-AH4. 

EXAMPLE 16: ETHYLENE INDUCTION OF MICRQSQMAIi NICOTINE 
DEMETHYLASE IN TOBACCO CONVERTER LINES 

Biochemical analyses of demethylase enzymatic activity in 
microsomal enriched fractions of ethylene treated and non- 
treated pairs of converter and non-converter tobacco lines was 
performed as follows. 

A. Preparation of Microsomes 

Microsomes were isolated at 4''C. Tobacco leaves were 
extracted in a buffer consisting of 50 mM N- (2-hydrooxyethyl) 
piperazine-N' - (2-ethanesulf onic acid) (HEPES) , pH 7.5, 3 mM 
DL-Dithiothreitol (DTT) and Protease Inhibitor Cocktail 
(Roche) at 1 tablet/50 ml. The crude extract was filtered 
through four layers of cheesecloth to remove undisrupted 
tissue, and the filtrate was centrifuged for 20 min at 20,000 
X g to remove cellular debris. The supernatant was subjected 
to ultracentrifugation at 100,000 x g for 60 min and the 
resultant pellet contained the microsomal fraction. The 
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microsomal fraction was suspended in the extraction buffer and 
applied to an ultracentrif ugation step where a discontinuous 
sucrose gradient of 0-5 M sucrose in the extraction buffer was 
used. The purified microsomes were resuspended in the 
extraction buffer supplemented with 10% (w/v) glycerol as 
cryoprotectant . Microsomal preparations were stored in a 
liquid nitrogen freezer until use. 

B, Protein Concentration Determination 

Microsomal proteins were precipitated with 10% 
Trichloroacetic Acid (TCA) (w/v) in acetone, and the protein 
concentrations of microsomes were determined using RC DC 
Protein Assay Kit (BIO-RAD) following the manufacturer's 
protocol 

3 . Nicotine Demethylase Activity Assay 

DL-Nicotine ( Pyrrolidine-2-^^C) was obtained f rom Moravek 
Biochemicals and had a specific activity of 54 mCi/mmol. 
Chlorpromazine (CPZ) and oxidized cytochrome c (cyt, C), both 
P450 inhibitors, were purchased from Sigma. Reduced form of 
nicotinamide adenine dinucleotide phosphate (NADPH) is the 
typical electron donor for cytochrome P450 via the 
NADPH : cytochrome P450 reductase. NADPH was omitted for control 
incubation- Routine enzyme assay consisted of microsomal 
proteins (around 2 mg/ml) , 5 iriM NADPH, 55 ^^C labeled 

nicotine. The concentration of CPZ and Cyt. C, when used, was 
1 mM and 100 |4M, respectively. The reaction was carried at 
25°C for 1 hour and was stopped with addition of 300 \xl 
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methanol to each 25 |il reaction mixture. After spinning, 20 (il 
of the methanol extract was separated with a reverse-phase 
High Performance Liquid Chromatography (HPLC) system (Agilent) 
using an Inertsil ODS-3 3|J, {150 x 4.6 mm) column from Varian. 
The isocratic mobile phase was the mixture of methanol and 5 0 
mM potassium phosphate buffer, pH 6.25, with ratio of 60:40 
(v/v) and the flow rate was 1 ml/min. The nornicotine peak, as 
determined by comparison with authentic non-labeled 
nornicotine, was collected and subjected to 29 00 tri-carb 
Liquid Scintillation Counter (LSC) (Parkin Elmer) for 
quantification- The activity of nicotine demethylase is 
calculated based on the production of labeled nornicotine 
over 1 hour incubation. 

Samples were obtained from pairs of Burley converter 
(line 4407-33) and non-converter (line 4407-25) tobacco lines 
that were ethylene treated or not. All untreated samples did 
not have any detectable microsomal nicotine demethylase 
activity. In contrast, microsomal samples obtained from 
ethylene treated converter lines were found to contain 
significant levels of nicotine demethylase activity. The 
nicotine demethylase activity was shown to be inhibited by 
P450 specific inhibitors demonstrating the demethylase 
activity was consistent to a P450 microsomal derived enzyme. 
A typical set of enzyme assay results obtained for the burley 
converter tobacco line is shown in the Table VI. In contrast, 
sample derived from ethylene treated nonconverter tobacco did 
not contain any nicotine demethylase activity. These results 
demonstrated that nicotine demethylase activity was induced 
upon treatment with ethylene in converter lines but not in the 
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corresponding isogenic nonconverter line. Similar results 
were obtained for an isogenic dark tobacco variety pair, where 
microsomal nicotine demethylase activity was induced in 
converter lines and not detectable in nonconverter paired 
lines. Together these experiments demonstrated that 

microsomal nicotine demethylase activity is induced upon 
ethylene treatment in converter lines while not in paired 
isogenic nonconverter lines. Those genes that are P450 
derived genes and are preferentially induced in converter 
lines relative to paired non-converter lines are candidate 
genes to encode the nicotine demethylase enzyme. 



Table VI: DEMETHYIiASE ACTIVITY IN MICROSOMES OF ETHYLENE 
INDUCED BURLEY CONVERTER AND NON CONVERTER LINES 



Sample 


Microsomes 


Microsomes + 
1 mM chlor- 
promazine 


Microsomes -h 
with 100 pM 
cytochrome C 


Microsomes - 

NADPH 


Converter 


0,6 ± 0.05 
pkat / mg 


0.01 ± 0.01 
pkat / mg 


0,03 ± 0.05 
pkat / mg 


Q,03 ± 0.04 
pkat / mg 


Non- 
Converter 


Not Detected 


Not Detected 


Not Detected 


Not Detected 



Niomerous modifications and variations in practice of the 
invention are expected to occur to those skilled in the art upon 
consideration of the foregoing detailed description of the 
invention. Consequently, such modifications and variations are 
intended to be included within the scope of the following claims. 
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V9HA.T IS CIAXMMV IS: 

1. An isolated nucleic acid molecule from Nicotiana wherein 
said nucleic acid molecule comprising a nucleic acid 
sequence selected from the group consisting of SEQ. ID. No. 
299 through SEQ. ID. No. 357. 

2. A transgenic plant, wherein said transgenic plant comprises 
the nucleic acid molecule of Claim 1. 

3. The transgenic plant of Claim 24, wherein said plant is a 

tobacco plant. 

4. A method of producing a transgenic plant, wherein said 
method comprises the steps of: 

(i) operably linking said nucleic acid molecule of any one of 
the Claim 1 with a promoter functional in said plant to 
create a plant transformational vector; 

(ii) transforming said plant with said plant transformational 
vector of step; 

(iii) selecting a plant cell transformed with said transformation 
vector; and 

(iv) regenerating a transformation plant from said transformed 
plant cell. 

5. The method of Claim 4, wherein said nucleic acid molecule is 
in an antisense orientation. 

6- The method of Claim 4, wherein said nucleic acid molecule is 
in a sense orientation. 
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7. The method of Claim 4, wherein said nucleic acid molecule is 
in a RNA interference orientation, 

8. The method of Claim 4, wherein said nucleic acid molecule is 
expressed as a double stranded RNA molecule. 

9. The method of Claim A, wherein said double stranded RNA 
molecule is about 15 to 25 nucleotides in length, 

10. The method of Claim 4, wherein said transgenic plant is a 
tobacco plant. 

11. A method of selecting a plant containing a nucleic acid 
molecule, wherein said plant is analyzed for the presence of 
nucleic acid sequence selected from the group consisting of 
299 through 357 . 

12. The method of selecting a plant of Claim 11, wherein said 
plant is analyzed by DNA hybridization. 

13. The method of selecting a plant of Claim 11, wherein said 
DNA hybridization is Southern blot analysis. 

14. The method of selecting a plant of Claim 11 wherein said DNA 
hybridization is Northern blot analysis, 

15. The method of selecting a plant of Claim 11/ wherein said 
plant is analyzed by PCR detection. 

16. The method of Claim 11, wherein said plant is a tobacco 
plant . 
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17. A method of increasing or decreasing nornicotine levels in a 
plant, wherein said method comprises the steps of : 

(i) operably linking said nucleic acid molecule of claim 1 
5 with a promoter functional in said plant to create a plant 

transformational vector; 

(ii) transforming said plant with said plant 
transformational vector of step (i) ; 

(iii) selecting a plant cell transformed with said 
10 transformation vector; and 

(iv) regenerating a transformation plant from said 
transformed plant cell . 

18- The method of Claim 17, wherein said nucleic acid 
15 molecule is in an antisense orientation. 

19. The method of Claim 17, wherein said nucleic acid 
molecule is in a sense orientation. 

20 21. The method of Claim 17, wherein said nucleic acid 

molecule is in a RNA interference orientation. 

22. The method of Claim 17, wherein said nucleic acid 
molecule is expressed as a double stranded RNA molecule. 

25 

23. The method of Claim 17, wherein said transgenic plant 
is a tobacco plant. 
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SBQ XD X D58-BQ7 

1 GCACAACTT GCTATCAACT TGGTCACATC TATGTTGGGT 
61 CATTTGTTGC ATCATTTTAC ATGGGCTCCG GCCCCGGGGG TTAACCCGGA GGATATTGAC 
121 TTGGAGGAGA GCCCTGGAAC AGTAACTTAC ATGAAAAATC CAATACAAGC TATTCCAACT 
181 CCAAGATTGC CTGCACACTT GTATGGACGT GTGCCAGTGG ATATGTA& 
SBQ ID 2 

AQLAINLVTSMLGHLLHHFTWAPAPGVNPEDIDIiEESPGTVTY^IF^ 



FIG. 2 

SBQ ID 3 D58-ABX 

1 GCACAACT TGCTATCAAC TTGGTCACAT CTATGTTGGG 
61 TCATTTGTTG CATCATTTTA CGTGGGCTCC GCCCCGGGGG GTTAACCCGG AGAATATTGA 
121 CTTGGAGGAG AGCCCTGGAA CAGTAACTTA CATGAAAAAT CCAATACAAG CTATTCCTAC 
181 TCCAAGATTG CCTGCACACT TGTATGGACG TGTGCCAGTG GATATGTAA 
SBQ ID 4 

AQllAINLWS^lLGHLLHHFTWAPPPGVNPENIDLEESPGTVTy^IKNPIQAIPTPRL 



FIG. 3 

SBQ ID 5 BX86-AH4 

1 ATGAATTAT TCATTGCAAG TGGAACACCT TTCAATTGCT 
61 CATATGATCC AAGGTTTCAG TTTTGCAACT ACGACCAATG AGCCTTTGGA TATGAAACAA 
121 GGTGTGGGTT TAACTTTACC AAAGAAGACT GATGTTGAAG TGCTAATTAC ACCTCGCCTT 
181 CCTCCTACGC TTTATCAATA TTAA 
SBQ ID 6 

MNYSLQVEHLSIAHMIQGFSFATTTNEPLDMKQGVGLTLPKKT0VE\njITPRLPPTLYQy 



FIG. 4 

SBQ XD 7 D56-BB4 

1 GCACAACTT GCTATCAACT TGGTCACATC TATGTTGGGT 
61 CATTTGTTCA TCATTTTACA TGGGCTCCGG CCCCGGGGGT TAACCCGGAG GATATTGACT 
121 TGGAGGAGAG CCCTGGAACA GTAACTTACA TQX 
SBQ XD 8 

AQLAINLVTSMLGHLFI ILHGLRPRGIiTRRILTWRRALEQ 



FIO. 5 

SBQ ID 9 Dse-jur? 

1 GAAGGATTG GCTGTTCGAA TGGTTGCCTT GTCATTGGGA 
61 TGTATTATTC AATGTTTTGA TTGGCAACGA ATCGGCGAAG AATTGGTTGA TATGACTGAA 
121 GGAACTGGAC TTACTTTGCC TAAAGCTCAA CCTTTGGTGG CCAAGTGTAG CCCACGACCT 
181 AAAATGGCTA ATCTTCTCTC TCAGATTTGX 

SBQ ID 10 

EGLAVRMVALSLGCIIQCFDWQRIGEELVDhlTEGTGLTLPKAQPLVAKCSPRPKMANLLSQI 
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SBQ ID 11 01 3a- 5 

1 GAAGGATTG GCTATTCGAA TGGTTGCATT GTCATTGGGA 
61 TGTATTATTC AATGCTTTGA TTGGCAACGA CTTGGGGAAG GATTGGTTGA TAAGACTGAA 
121 GGAACTGGAC TTACTTTGCC TAAAGCTCAA CCTTTAGTGG CCAAGTGTAG CCCACGACCT 
181 ATAATGGCTA ATCTTCTTTC TCAGATTTGA 
SEQ ID 12 

EGIJ^IRWALSLGCIIQCFDWQRLGEGLVDKTEGTGLTLPKAQPLVAKCSPRPI^1A1>TLLSQI 



PIG. 7 

SEQ ID 13 D56-AG10 

1 ATAGGTTTT GCGACTTTAG TGACACATCT GACTTTTGGT 
61 CGCTTGCTTC AAGGTTTTGA TTTTAGTAAG CCATCAAACA CGCC AATTGA CATGACAGAA 
121 GGCGTAGGCG TTACTTTGCC TAAGGTTAAT CAAGTTGAAG TTCTAATTAC CCCTCGTTTA 
181 CCTTCTAAGC TTTATTTATT TTGA 
SEQ ID 14 

IGFATLVTHLTFGRLLQGFDFSKPSNTPIDMTEGVGVTLPKVNQVEVLITPRLPSKLYLF 



FIG. 8 

SEQ ID 15 D35-33 

1 ATAGGCTTT GCGACTTTAG TGACACATCT GACTTTTGGT 
61 CGCTTGCTTC AAGGTTTTGA TTTTAGTAAG CCATCAAACA CGCCAATTGA CATGACAGAA 
121 GGCGTAGGCG TTACTTTGCC TAAGGTTAAT CAAGTTGAAG TTCTAATTAC CCCTCGTTTA 
181 CCTTCTAAGC TTTATTTAT 
SEQ ID 16 

IGFATLVTHLTFGRLLQGFDFSKPSNTPIDMTEGVGVTLPKVNQVEVLITPRLPSKLYL 



FIG. 9 

SBQ ID 17 D34-62 

1 ATAAATTTT GCGACTTTAG TGACACATCT GACTTTTGGT 
61 CGCTTGCTTC AAGGTTTTGA TTTTAGTACG CCATCAAACA CGCCAATAGA CATGACAGAA 
121 GGCGTAGGCG TTACTTTGCC TAAGGTAAAT CAAGTGGAAG TTCTAATTAG CCCTCGTTTA 
181 CCTTCTAAGC TTTATGTATT CTGA 
SEQ ID 18 

INFATLVTHLTFGRLLQGFDFSTPSOTPID^^^EGVGVTLPKVWQVEVLISPR^ 



FIG. 10 

SEQ ID 19 D56AA7 

1 ATTATACTT GCATTGCCAA TTCTTGGCAT CACTTTGGGA 
61 CGTTTGGTTC AGAACTTTGA GCTGTTGCCT CCTCCAGGCC AGTCGAAGCT CGACACCACA 
121 GAGAAAGGTG GACAGTTCAG TCTCCACATT TTGAAGCATT CCACCATTGT GTTGAA.\CCA 
181 AGGTCTTTCT GA 
SEQ ID 20 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVLKPRSF 
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SBQ ID 21 D56-AJB1 

1 ATTATACTT GCATTGCCAA TTCTTGGCAT TACTTTGGGA 
61 CGTTTGGTTC AGAACTTTGA GCTGTTGCCT CCTCCAGGCC AGTCGAAGCT CGACACCACA 
121 GAGAAAGGTG GACAGTTCAG TCTCCATATT TTGAAGCATT CCACCATTGT GTTGAAACCA 
181 AGGTCTTGCT GA 
SBQ ID 22 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVLKPRSC 



FIG. 12 

SBQ ID 23 035-BB7 

1 TATTGCACTT GGGGTTGCAT CAATGGAACT TGCATTGTCA 
61 AATCTTCTTT ATGCATTTGA TTGGGAGTTA CCTTTTGGAA TGAAAAAAGA AGACATTGAC 
121 ACAAACGCCA GGCCTGGAAT TACCATGCAT AAGAAAAACG AACTTTATCT TATCCCTAAA 
181 AATTATCTAT AG 
SEQ ID 24 

IALGVASMELALS^rLLYAFDWELPFG^^a<:EDIDTOARPGIT^IHKKl^ 



PIG* 13 

SEQ ID 25 D177-BA7 

1 ATTGCACTTG GGGTTGCATC CATGGAACTT 

121 GCTTTGTCAA ATCTTCTTTA TGCATTTGAT TGGGAGTTAC CTTACGGAGT GAAAAAAGAA 

181 AACATTGACA CAAATGTCAG GCCTGGAATT ACCATGCATA AGAAAAACGA ACTTTGCCTT 

2 41 ATCCCTAGAA ATTATCTATA G 
SBQ ID 2€ 

IALGVASMEUU:.Sin.LYAFDWELPYGVKKENIDTNVRPGITMHKra 



FIG. 14 

SBQ ID 27 D56A-AB6 

1 GGTATTGCAC TTGGGGTTGC ATCCATGGAA CTTGCTTTGT CAAATCTTCT TTATGCATTT 

61 GATTGGGAGT TGCCTTATGG AGTGAAAAAA GAAGACATCG ACACAAACGT TAGGCCTGGA 

121 ATTGCCATGC ACAAGAAAAA CGAACTTTGC CTTGTCCCAA AAAATTATTT ATAA 
SBQ ID 28 

IALGVASMELALSNLLYAFDV^LPYGVKKEDIDTNWPGIA^mKKNELCLV^ 



FIG. 15 

SBQ ID 29 D144-AB2 

1 ATT GCACTTGGGG TTGCATCCAT GGAACTTGCT 
61 TTGTCAAATC TTCTTTATGC ATTTGATTGG GAGTTGCCTT ATGGAGTGAA AAAAGAAGAC 
121 ATCGACACAA ACGTTAGGCC TGGAATTGCC ATGCACAAGA AAAACGAACT TTGCCTTGTC 
181 CC AAAAAAAT TATTTATAAA TTATATTGGG ACGTGGATCT CATGCTAG 
SBQ ID 30 

lALGVASMELALSl^LYAFDWELPYGVKKSDIDTNVRPGI 
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SBQ ID 31 056-AGll 

1 ATTTCGTTT GGTTTAGCTA ATGCTTATTT GCCATTGGCT 
61 CAATTACTTT ATCACTTTGA TTGGGAACTC CCCACTGGAA TCAAACCAAG CGACTTGGAC 
121 TTGACTGAGT TGGTTGGAGT AACTGCCGCT AGAAAAAGTG ACCTTTACTT GGTTGCGACT 
181 CCTTATCAAC CTCCTCAAAA CTGA 
SEQ ID 32 

ISFGIJ^AYLPLAQLLVHFDWELPTGIKPSDLDLTELVGVTAARKSDL^rLVATPYQPPQN 



FIG. 17 

SEQ ID 33 0179-AJa 

1 ATTTCGTTT GGCTTAGCTA ATGCTTATTT GCCATTGGCT 
61 CAATTACTAT ATCACTTCGA TTGGAAACTC CCTGCTGGAA TCGAACCAAG CGACTTGGAC 
121 TTGACTGAGT TGGTTGGAGT AACTGCCGCT AGAAAAAGTG ACCTTTACTT GGTTGCGACT 
181 CCTTATCAAC CTCCTCAAAA GTGA 
SEQ ID 34 

ISFGLANAYLPLAQLLYHFDWKLPAGIEPSDLDLTELVGVTAARKSDLYLVATPYQPPQK 



FIG. 18 

SBQ ID 35 D56-AC7 

1 ATGCTATTT GGTTTAGCTA ATGTTGGACA ACCTTTAGCT 
61 CAGTTACTTT ATCACTTCGA TTGGAAJ^CTC CCTAATGGAC AAAGTCATGA GAATTTCGAC 
121 ATGACTGAGT CACCTGGAAT TTCTGCTACA AGAAAGGATG ATCTTGTTTT GATTGCCACT 
181 CCTTATGATT CTTATTAATTCCAGTCTA TATCATCTAT ATGTACTCAA TAATTGTATG 
361 GGA 
SEQ ID 36 

^ILFGIJa^GQPLAQLLyHFDWKLPNGQSHENFDMTESPGISATRKDDLVLIATPyDSY 



FIG. 19 

SBQ ID 37 D144-AD1 

1 ATGC TATTTGGTTT AGCTAATGTT 
61 GGACAACCTT TAGCTCAGTT ACTTTATCAC TTCGATTGGA AACTCCCTAA TGGACAAACT 
121 CACCAAAATT TCGACATGAC TGAGTCACCT GGAATTTCTG CTACAAGAAA GGATGATCTT 
181 ATTTTGATTG CCACTCCTGC TCATTCTTGA 
SEQ ID 38 

MLFGIxAWGQPIJ^QLLYnHFDWKLPNGQTHQrTFDMTESPGISATRKDDLILIATPA^ 



FIG. 20 

SBQ ID 39 D144-AB5 

1 TTAT TATTCGGTTT AGTTAATGTA 
61 GGACATCCTT TAGCTCAATT GCTTTATCAC TTCGATTGGA AGACTCTTCC TGGGATAAGT 
121 TCAGATAGTT TCGACATGAC TGAAACAGAT GGAGTAACTG CCGGAAGAAA GGATGATCTT 
181 TGTTTAATTG CTACTCCTTT TGGTCTCAAT TAA 
SBQ ID 40 

LLFGLVWGHPLAQLLVHFDWKTLPGISSDSFDMTETDGVTAGRKDDLCLIATPFGLN 
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FX6. 21 

SBQ ID 41 D181-AB5 

1 A TGTCGTTTGG TTTAGTTAAC ACTGGGCATC CTTTAGCTCA 
61 GTTGCTCTAT TTCTTTGACT GGAAATTCCC TCATAAGGTT AATGCAGCTG ATTTTCACAC 
121 TACTGAAACA AGTAGAGTTT TTGCAGCAAG CAAAGATGAC CTCTACTTGA TTCCAACAAA 
181 TCACATGGAG CAAGAGTAG 
SEQ ID 42 

msfglvotghpiaqllyffdwkfphkwaadfhttetsrvfaaskddlyliptn™ 



FIG. 22 

SBQ ID 43 D73-AC0 

1 AT gtcgtttggt ttagttaaca cagggcatcc tttagcccag 
121 ttgctctatt gctttgactg gaaactccct gacaaggtta atgcaaatga ttttcgcact 
181 actgaaacaa gtagagtttt tgcagcaagc aaagatgacc tctacttgat tcccacaaat 
241 cacagggagc aagaatag 

SEQ ID 44 

MSFGLVNTGHPLAQLLYCFDWKLPDKWANDFRTTETSRVFAASKDDLYLIPTN^ 



FIG. 23 

SEQ ID 45 D56-AC12 

1 ATGCAATTT GGTTTGGCTC TTGTTACTCT GCCATTGGCT 
61 CATTTGCTTC ACAATTTTGA TTGGAAACTT CCCGAAGGAA TTAATGCAAG GGATTTGGAC 
121 ATGACAGAGG CAAATGGGAT ATCTGCTAGA AGAGAAAAAG ATCTTTACTT GATTGCTACT 
181 CCTTATGTAT CACCTCTTGA TTAA 
SEQ ID 46 

MQFGLALVTLPZJUrLLHNFDWTKLPEGINAI^ 



FIG. 24 

SBQ ID 47 D58-AB9 

1 ATGACTTAT GCATTGCAAG TGGAACACCT AACAATGGCA 
61 CATTTGATCC AGGGTTTCAA TTACAGAACT CCAACTGATG AGCCCTTGGA TATGAAAGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGTGAAAG TGATAATTAC GCCTCGCTTG 
181 GCACCTGAGC TTTATTAA 
SBQ ID 48 

MTYALQVEHLT^I^UiLIQGFmrRTPTDEPLDMKEGAGITIRKVNPVKVI ITPRLAPELY 



FIG. 25 

SBQ ID 49 D56~AG9 

1 ATGACTTAT GCATTGCAAG TGGAACACCT AACAATGGCA 
61 CATTTAATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGGCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGTGGAAC TGATAATAGC GCCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SBQ ID 50 

MTYALQVEHLTMAHLIQGFNYKTPNDEAI*DMKEGAGITIRK\mPVELIIAPRIiAPELY 
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SBQ ID 51 V56':kG6 

1 ATGACTTAT GCATTGCAAG TG<3AACACCT AACAATGGCA 
61 CATTTAATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGGCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACAATACG TAAGGTAAAT CCAGTGGAAT TGATAATAAC GCCTCGCTTG 
181 GCACCTGAGC TTTACTAA 
SEQ ID 52 

MTYALQVEHLT^lAHLIQGFm'KTPm)EAl>D^IKEGAGITIRKVNPV^LIITPR^ 



FIG. 27 

SEQ ID 53 D35-BG11 

1 ATGACTTAT GCATTGCAAG TGGAACACTT AACAATGGCA 
61 CATTTGATCC AAGGTTTCAA TTACAGAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGTGGAAC TGATAATAGC GCCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 54 

MTYAL.QVEHLT^lAHLIQGFm'RTP^TDEPLD^IKEGAGITIRKVNPVELIIAPR^ 



FIG. 28 

SEQ ID 55 D35-42 

1 ATGACTTAT GCATTGCAAG TGGAACACTT AACAATGGCA 
61 CATTTGATCC AAGGTTTCAA TTACAGAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGTGGAAC TGATAATAGC GCCCCTGGCA 
181 CCTGAGCTTT ATTAA 
SEQ ID 56 

MTYALQVEHLTaAHLIQGFm^RTPr^EPLDMKEGAGITIRK^PVELIIAPLAPELY 



FIG. 29 

SEQ ID 57 D3 5~BA3 

1 ATGACTTAT GCATTGCAAG TGGAACACTT AACAATGGCA 
61 CATTTGATCC AAGGTTTCAA TTACAGAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACTATACG TAAGGTAAAT CCTGCGGAAC TGATAATAGC GCCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 58 

MTYALQVEHLTMAHLIQGFInTYRTPNDEPLDMKJEGAGITIRKVNP^^ 



FIG. 30 

SEQ ID 59 D34-57 

1 ATGACTTAT GCATTACAAG TGGAACACCT AACAATAGCA 
61 CATTTGATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGAT TAACCATACG TAAAGTAAAT CCTGTAGAAG TGACAACTAC GCCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 60 

MTYALQVEHLTIAHLIQGF^nfKTPNDEPLDMKEGAGLTIRKVNPVEVTTTAR^ 
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aSQ ID 6X D34-52 

1 ATGACTTAT GCATTACAAG TGGAACACCT AACAATAGCA 
61 CATTTGATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGAT TAACTATACG TAAAGTAAAT CCTGTAGAAG TGACAATTAC GGCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SBQ ID 62 

MT YALQVEHLT I AHLI QGFbrn<TPr^E PLD^^<EG AGLTI RKVNPVEVT I TAR^ 



FIG. 32 

SBQ ID 63 D34-25 

1 ATGACTTAT GCATTACAAG TGGAACACCT AACAATAGCA 
61 CATTTGATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGCCCCTGGA TATGAAGGAA 
121 GGTGCAGGAT TAACTATACG TAAAGTAAAT CCTGTAGAAG TGACAATTAC GGCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SBQ ID 64 

MTYALQVEHLTIAHLIQGFm^KTPbmEPLDMKEGAGLTIRKVNPVEVTITARL^ 



PIG. 33 

SEQ ID 65 D56AD10 

1 TATAGCCTT GGACTTAAGG TTATCCGAGT AACATTAGCC 

61 AACATGTTGC ATGGATTCAA CTGGAAATTA CCTGAAGGTA TGAAGCCAGA AGATATAAGT 

121 GTGGAAGAAC ATTATGGGCT CACTACACAT CCTAAGTTTC CTGTTCCTGT GATCTTGGAA 

181 TCTAGACTTT CTTCAGATCT CTATTCCCCC ATCACTTAA 
SEQ ID 66 

YSLGLKVIRWLAJm.HGFNVi^KXPEG^^<PEDISVEEHYGLTT^ 



FIG. 34 



SEQ ID 67 D56-AA11 

1 ATACAGTCTT GGGATTCGTA TAATTAGGGC AACTTTAGCT 

61 AACTTGTTGC ATGGATTCAA CTGGAGATTG CCTAATGGTA TGAGTCCAGA AGACATTAGC 

121 ATGGAAGAGA TTTATGGGCT AATTACACAC CCCAAAGTCG CACTTGACGT GATGATGGAG 

181 CCTCGACTTC CCAACCATCT TTACAAATAG 

SBQ ID 68 

YSLGIRIIRATIANLLHGFNWRLPNGMSPEDISMEEIYGLITHPK^ 



FIG. 35 

SEQ ID 69 D177«-BD5 

1 ATTAATTTTT CAATACCACT TGTTGAGCTT 
121 GCACTTGCTA ATCTATTGTT TCATTATAAT TGGTCACTTC CTGAAGGGAT GCTAGCTAAG 
181 GATGTTGATA TGGAAGAAGC TTTGGGGATT ACCATGCAtA AGAAATCTCC CCTTTGCTTA 
241 GTAGCTTCTC ATTATACTTG TTGA 
SBQ ID 70 

INFSI PLVELAIANLLFHYlMSLPEGMLiAia^ 
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SEQ ID 71 D56A-AG10 

1 ATGCAACTTG GGCTTTATGC ATTGGAAATG GCTGTGGCCC ATCTTCTTCA TTGTTTTACT 
61 TGGGAATTGC CAGATGGTAT GAAACCAAGT GAGCTTAAAA TGGATGATAT TTTTGGACTC 
121 ACTGCTCCAA AAGCTAATCG ACTCGTGGCT GTGCCTACTC CACGTTTGTT GTGTCCCCTT 
181 TATTAATTOA 
SEQ ID 72 

MQLGLYALEU*lAVAHLLHCFTWELPDGIdKPSELKMDDIFGLTAPKA^JRLVAVPTPRLLCPLY 



PIG. 37 

SEQ ID 73 58*BC5 

1 ATGCAACTT GGGCTTTATG CATTAGAAAT GGCAGTGGCC 
61 CATCTTCTTC TTTGCTTTAC TTGGGAATTG CCAGATGGTA TGAAACCAAG TGAGCTTAAA 
121 ATGGATGATA TTTTTGGACT CACTGCTCCA AGAGCTAATC GACTCGTGGC TGTGCCTAGT 
181 CCACGTTTGT TGTGCCCACT TTATTAA 
SEQ ID 74 

MQLGLYALEMAVAHLLLCFTWELPDGMKPSELKMDDIFGLTAPRA]^^^V^^ 



FIG. 38 

SEQ ID 75 D58-AD12 

1 ATGCAACTT GGGCTTTATG CATTGGAAAT GGCTGTGGCC 

61 CATCTTCTTC ATTGTTTTAC TTGGGAATTG CCAGATGGTA TGAAACCAAG TGAGCTTAAA 

121 ATGGATGATA TTTTTGGACT CACTGCTCCA AGAGCTAATC GACTCGTGGC TGTGCCTAGT 

181 CCACGTTTGT TGTGTCCCCT TTATTAA 
SEQ ID 76 

MQLGLYALEMAVAHLLHCFTVJELPDG^1KPSELK14DDIFGLTAPRAN1U.VAVPTPRLLCPLY 



PIG. 39 

SEQ ID 77 D56-AC11 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACTTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT TCAGAGTAGC ATGA 
SEQ ID 78 

MLWSAS I VRVS YLTCI YRFQVYAGSVFRVA 



PIG. 40 

SEQ ID 79 D35-39 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACTTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT TCAGAGTAGC ATGA 
SBQ ID 80 

MLWSAS I VRVS YLTC I YRFQVYAGSVFRVA 
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SEQ ID 81 ]>5d-Bn4 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACCTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT TCAGAGTAGC ATGX 
SBQ ID 82 

MLWSAS I VRVSYLTC I YRFQVYAGSVFRVA 



FIG. 42 

SEQ ID 83 0177 -BD7 

1 ATTAATTTTT CAATACCACT TGTTGAGCTT GCACTTGCTA ATCTATTGTT TCATTATAAT 
61 TGGTCACTTC CTGAGGGGAT GCTACCTAAG GATGTTGATA TGGAAGAAGC TTTGGGGATT 
121 ACCATGCACA AGAAATCTCC CCTTTGCTTA GTAGCTTCTC ATTATAACTT GTTGTGA 
SBQ ID 84 

INFSIPLVELAIJ^LFHYNWSLPEGMLPKDVDblEEALGIT^ 



FIG* 43 
SEQ ID 85 

D176-BF2 

1 AT ATCATTTGGT TTGGCTAATG TTTATTTGCC ACTAGCTCAA 
121 TTGTTATATC ATTTTGATTG GAAACTCCCT ACTGGAATCA ATTCAAGTGA CTTGGACATG 
181 ACTGAGTCGT CAGGAGTAAC TTGTGCTAGA AAGAGTGATT TATACTTGAC TGCTACTCCA 
241 TATCAACTTT CTCAAGAGTG A 
SEQ ID 86 

GISFGLAWYLPIlAQLLYHFDV\n<l>PTGINSSDLDMTESSGVTCARKSDLYLTATPYQLSQE 



FIG. 44 

SEQ ID 87 I>5 6 -AS>6 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACTTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT CCAGAGTAGC ATOA 
SEQ ID 88 

MLWSASIVRVSYLTCIYRFQVYAGSVSRVA 



FIG* 45 

SEQ ID 89 D73A-AD6 

1 CT GAATTTTGCA ATGTTAGAGG CAAAAATGGC ACTTGCATTG 
121 ATTCTACAAC ACTATGCTTT TGAGCTCTCT CCATCTTATG CACATGCTCC TCATACAATT 
181 ATCACTCTGC AACCTCAACA TGGTGCTCCT TTGATTTTGC GCAAGCTGTA G 
SEQ ID dO 

L^^FA^^.EAK1^^ALALILQHYAFELSPSYAHAPHTIITLQPQHGAPLILR^ 



9/141 



wo 2005/038033 



PCT/US2004/034065 



SBQ ID 91 D70A-BA11 

1 CT GAATTTTGCA ATGTTAGAGG CAAAAATGGC ACTTGCATTG 
121 ATTCTACAAC ACTATGCTTT TGAGCTCTCT CCATCTTATG CACACGCTCC TCATACAATT 
181 ATCACTCTGC AACCTCAACA TGGTGCTCCT TTGATTTTGC GCAAGCTGTX O 
SEQ ID 92 

U^JFA^ILEAK^1AIJ^LILQHYAFELSPSYAHAPHTIITLQPQHGAPLILI^ 



FIG. 47 

SBQ ID 93 D70A-BB5 

1 AA TAATTTTGCA ATGTTGGAAA CTAAGATTGC CTTAGCAATG 
121 ATCCTACAGC GTTTTGCTTT CGAGCTTTCT CCATCTTACG CTCATGCACC TACTTATGTC 
181 GTCACTCTTC GACCTCAGTG TGGTGCTCAC TTAATCTTGC AAAAATTATA GGTCCTTAAT 
241 CTGGATTTCC CATTATTGAG TAGTGCCTAA TAAATCTTCT CTATCACTAT TTTTCCATCT 
301 TTCA 
SBQ IP 94 

NNFAl^ETKIALAMILQRFAFELSPSYAHAPTYVVTLRPQCGAHI.ILQKIi 



FIG. 48 

SBQ ID 95 D7 0A-AB5 

1 AGCGAAGGGG TGGCAAAGGC AACAAAGGGG AAAATGACAT 
61 CCGCGAAAAT GCATTGGGCA AAACTTCGCG ATTTTGGAAG 
121 ATTCTACAAC GCTTCTCCTT CGAGCTCTCC CCATCTTATA 
181 GTCACTTTGA AACCCAAATA TGGTGCTCCC CTAATAATGC 
241 AATATGCTAT CCGAGGAATT CAGTTCCT 
SEQ ID 96 

QNFAILEAKMAIAMILQRFSFELSPSYTHSPYTWTLKPKYGAPLIMHRL 



FIG. 49 

SEQ ID 97 D70A-AAd 

1 AGCGAAGGGG TGGCAAAGGC AACAAAGGGG AAAATGACAT ATTTTCCATT TGGTGCAGGA 
61 CCGCGAAAAT GCATTQGOCA AAACTTCGCG ATTTTGGAAG CAAAAATGGC TATAGCTATG 
121 ATTCTACAAC GCTTCTCCTT CGAGCTCTCT CCATCTTATA CACACTCTCC ATACACTGTG 
181 GTCACTTTGA AACCCAAATA TGGTGCTCCC CTAATAATGC ACAGGCTGTA GTCCTGT 
SEQ ID 98 

QNFArLEAKMAIAMILQRFSFELSPSYTHSPYTWTLKPKYGAPLIMHRL 



FIG. SO 

SBQ ID 99 D70A-ABd 

1 C AAAATTTTGC CATGTTAGAA GCAAAGATGG CTCTGTCTAT GATCCTGCAA 
121 CGCTTCTCTT TTGAACTGTC TCCGTCTTAT GCACATGCCC CTCAGTCCAT ATTAACCGT 
181 CAGCCACAAT ATGGTGCTCC ACTTATTTTC CACAAGCTAT AA 
SBQ ID 100 

Q^JFA^ILEAK^1ALSMILQRFSFELSPSYAHAPQSILTVQPQYGAPLIFHKL 



ATTTTCCATT 
CAAAAATGGC 
CACACTCTCC 
ACAGGCTGTA 



TGGTGCAGGA 
TATAGCTATG 
ATACACTGTG 
GTCCTGTGAG 
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SBQ ID 101 D70A-BH2 

1 AT AAACTTTGCA ATGACAGAAG CGAAGATGGC TATGGCTATG 
121 ATTCTGCAAC GCTTCTCCTT TGAGCTATCT CCATCTTACA CACATGCTCC ACAGTCTGTA 
181 ATAACTATGC AACCCCAATA TGGTGCTCCT CTTATATTGC ACAAATTGTX A 
SBQ ID 102 

IlTFAMTEAKMAMAMILQRFSFELSPSYTHAPQSVITMQPQYGAPLILHiCL 



FIG. 52 

SEQ ID 103 P70A-AA4 

1 AT AAACTTTGCA ATGGCAGAAG CGAAGATGGC TATGGCTATG 

121 ATTCTGCAAC GCTTCTCCTT TGAGCTATCT CCATCTTACA CACATGCTCC ACAGTCTGTA 

181 ATAACTATGC AACCCCAATA TGGTGCTCCT CTTATATTGC ACAAATTGTA A 
SBQ ID 104 

I^IFA^lAEAK^IA^lA^IILQRFSFELSPSYTHAPQSVITMQPQyGAPLILHKL 



PIG. 53 

SEQ ID 105 D70A-BA1 

1 CA AAACTTTGCA ATGATGGAAG CAAAAATGGC AGTAGCTATG 

121 ATACTACAAA AATTTTCCTT TGAACTATCC CCTTCTTATA CACATGCTCC ATTTGCAATT 

181 GTGACTATTC ATCCTCAGTA TGGTGCTCCT CTGCTTATGC GCAGACTTTA A 
SEQ ID 106 

QNFAMl-IEAKMAVAMILQKFSFELSPSYTHAPFAIVTIHPQYGAPLLMRRL 



FIG. 54 

SBQ ID 107 D70A-BA9 

1 CA AAACTTTGCA ATGATGGAAG CA-A^AATGGC AGTAGCTATG 

121 ATACTACATA AATTTTCCTT TGAACTATCC CCTTCTTATA CACATGCTCC ATTTGCAATT 

181 GTGACTATTC ATCCTCAGTA TGGTGCTCCT CTGCTTATGC GCAGACTTTA A 
SEQ ID 108 

Ql^AhmEAKMAVAMILHKFSFELPSYTHAPFAIVTIHPQYGAPLLMRRL 



FIG. 55 

SEQ ID 109 D70A-'BD4 

1 CA AAATTTTGCT ATGTTAGAGG CTAAAATGGC AATGGCTATG 
121 ATTCTGAAAA CCTATGCATT TGAACTCTCT CCATCTTATG CTCATGCTCC TCATCCACTA 
181 CTACTTCAAC CTCAATATGG TGCTCAATTA ATTTTGTACA AGTTGTAO 
SEQ ID 110 

QNFAMLEAKMAMAMILKTYAFELSPSYAHAPHPLLLQPQYGAQLILYKL 
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SEQ ID 111 D181-AC5 

1 TATAGCATGG GGCTCAAGGC GATTCAAGCT AGCTTAGCTA 
61 ATCTTCTACA TGGATTTAAC TGGTCATTGC CTGATAATAT GACTCCTGAG GACCTCAACA 
121 TGGATGAGAT TTTTGGGCTC TCTACACCTA AAAAATTTCC ACTTGCTACT GTGATTGAGC 
181 CAAGACTTTC ACCAAAACTT TACTCTGTTT GA 
S£Q ID 112 

YSMGLKAIQASLANLLHGFITWSLPDNMTPEDLNMDEIFGLSTPKKFPLAT^ 



FIG. 57 

SEQ ID 113 D144-AH1 

1 TAT AGCTTGGGGC TCAAGGAGAT TCAAGCTAGC 
61 TTAGCTAATC TTCTACATGG ATTTAACTGG TCATTGCCTG ATAATATGAC TCCTGAGGAC 
121 CTCAACATGG ATGAGATTTT TGGGCTCTCT ACACCTAAAA AATTTCCACT TGCTACTGTG 
181 ATTGAGCCAA GACTTTCACC AAAACTTTAC TCTGTTTGA 
SEQ ID 114 

YSLGLKEIQASLAlU>LHGFl^SLPD^mTPEDLNlv^DEIFGLSTPKKFPLATVIEPRL^ 



FIG. 58 

SEQ ID 115 D34-65 

1 CATAGCTTG GGGCTCAAGG TGATTCAAGC TAGCTTAGCT 
61 AATCTTCTAC ATGGATTTAA CTGGTCATTG CCTGATAATA TGACTCCTGA GGACCTCAAC 
121 ATGGATGAGA TTTTTGGGCT CTCTACACCT AAAAAATTTC CACTTGCTAC TGTGATTGAG 
181 CCAAGACTTT CACCAAAACT TTACTCTGTT TGA 
SEQ ID 116 

HSLGLKVIQASLANLLHGF^mSLPD^mTPEDIJ^n^EIFGLSTPKKFPLATVIE 



FIG. 59 

SEQ ID 117 D35-BG2 

1 CTGTGCTTT CCATGTTTAA TCTCTAGTTA TATACTGGCT 
61 TTGAATGTGA ATCTGTATCA TAATTTCTTG CAAATTTCTC CTTCCATTTC TTATTAA 
SEQ ID 118 

LCF PCL I S S Y I LAJ.NW1.YHNFLQ I S PS I S Y 



FIG. 60 

SEQ ID 119 D7 3A-AH7 

1 TCTG GACTTGCTCA ATGTGTGGTT GGTTTAGCTT TAGCAACTCT AGTGCAGTGT 
121 TTTGAGTGGA AAAGGGTAAG CGAAGAGGTG GTTGATTTGA CGGAAGGAAA AGGTCTCACT 
181 ATGCCAAAAC CCGAGCCACT CATGGCTAGG TGCGAAGCTC GTGACATTTT TCACAAAGTT 
2 41 CTTTCAGAAA TATCTTAA 
SEQ ID 120 

SGLAQCn/VGLALATLVQCFEWKRVSEEVVDLTEGKGLTMPKPEPLb 
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SBQ ID 121 058-* AAl 

1 TTGGGCTTG GCAACGGTGC ATGTGAATTT GATGTTGGCC 
61 CGAATGATTC AAGAATTTGA ATGGTCCGCT TACCCGGAAA ATAGGAAAGT GGATTTTACT 
121 GAGAAATTGG AATTTACTGT GGTGATGAAA AATCCTTTAA GAGCTAAGGT CAAGCCAAGA 
181 ATGCAAGTGG TGTAA 
SBQ ID 122 

L G LA TVHVl^ ^ILiARM I Q E F EW S A Y P ENRKVD F 



FIG* 62 

SEQ ID 123 D73A-AE10 

1 TATGCTT TGGCTATGCT TCATTTAGAG 

121 TACTTTGTGG CTAATTTGGT TTGGCATTTT CGATGGGAGG CTGTGGAGGG AGATGATGTT 
181 GATCTTTCAG AAAAGCTAGA ATTCACCGTT GTGATGAAGA ATCCACTTCG AGCTCGTATC 
241 TGCCCCAGAG TTAACTCTAT TTGA 
SEQ ID 124 

YAIJ^a^HLEYFVANLVWHFRWEAVEGDDVDLSEKLEFTVX^^ 



PIG. 63 

SEQ ID 125 D56A-AC12 

1 GGTCAGCAAG TTGGACTTCT TAGAACAACC ATTTTCATCG CCTCATTACT GTCTGAATAT 
61 AAGCTGAAAC CTCGCTCACA CCAGAAACAA GTTGAACTCA CCGATTTAAA TCCAGCAAGT 
121 TGGCTTCATT CGATAAAAGG CGAACTGTTA GTCGATGCGA TTCCTCGAAA GAAGGCGGCA 
181 TTTTAA 
SBQ ID 126 

GQQVGLLRTTIFIASLLSEYKLKPRSHQKQVELTDLNPASVn^HSIKGELLVDAIPRKKAAF 



FIG. 64 

SBQ ID 127 D177-BF7 

1 ATCACATTTG CTAAGTTTGT GAATGAGCTA 
121 GCATTGGCAA GATTAATGTT CCATTTTGAT TTCTCGCTAC CAAAAGGAGT TAAGCATGAG 
181 GATTTGGACG TGGAGGAAGC TGCTGGAATT ACTGTTAGAA GGAAGTTCCC CCTTTTAGCC 
241 GTCGCCACTC CATGCTCGTG A 
SEQ ID 128 

ITFAKFVNELALARLMFHFDFSLPKGVKHEDLDVEEAAGITVRRKF 



FIG. 65 

SBQ ID 129 D73A-AG3 

1 CA GAGGTATGCT ATAAACCATT TGATGCTCTT TATTGCGTTG 
121 TTCACGGCTC TGATTGATTT CAAGAGGCAC AAAACGGACG GCTGTGATGA TATCGCGTAT 
181 ATTCCAACCA TTGCTCCAAA GGATGATTGT AAAGTGTTCC TTTCACAGAG GTGCACTCGA 
241 TTCCCATCTT TTTCATGA 
SBQ ID 130 

QRYAINHL^^LFIALFTALIDFKRHKTPGCDDIAYIPTIAPKDDCKVFLSQRCTRFPSFS 
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SBQ ID 131 D70A-AA12 

1 ATG TCATTTGGTT TAGCTAATCT TTACTTACCA TTGGCTCAAT 
121 TACTCTATCA CTTTGACTGG AAACTCCCAA CCGGAATCAA GCCAAGAGAC TTGGACTTGA 
181 CCGAATTATC GGG AATAACT ATTGCTAGAA AGGGTGACCT TTACTTAAAT GCTACTCCTT 
241 ATCAACCTTC TCGAGAGTAA 
SEQ ID 132 

^ISFGLA1TLYLPLAQLLYHFDV^^<XPTGIKPRDLDLTELSGITIARKGDLYLN^ 



PIG. 67 

SEQ ID 133 D185-BC1 

1 TTGGGCTTG GCAACGGTGC ATGTGAATTT GATGTTGGCC 
61 CGAACGATTC AAGAATTTGA ATGGTCCGCT TACCCGGAAA ATAGGAAAGT GGATTTtACT 
121 GAGAAATTGG AATTTACTGT GGTGATGAAA AACCCTTTAA GAGCTAAGGT CAAGCCAAGA 
181 ATGCAAGTGG TGTAA 
SEQ ID 134 

LGLATVTWOTiMLARTIQEFEWSAYPENRKVD^ 



PIG. 68 

SBQ ID 135 D185-BG2 

1 TTGGGCTTG GCAACGGTGC ATGTGAATTT GATGTTGGCC 

61 CGAATGATTC AAGAATTTGA ATGGTCCGCT TACCCGGAAA ATAGGAAAGT GGATTTACTG 
121 AGAAATTGGA ATTTACTGTG GTGA 
SEQ ID 136 

LGLATVHV^^:i^IIJUmI QEFEWS AY P EinuCVDLLRl^^ 



PIG. 69 

SEQ ID 137 D185-BE1 

1 ATCACATTT GCTAAGTTTG TGAATGAGCT AGCATTGGCA 
61 AGATTAATGT TCCATTTTGA TTTCTCGCTA CCAAAAGGAG TTAAGCATGA GGATTTGGAC 
121 GTGGAGGAAG CTGCTGGAAT TACTGTTAGG AGGAAGTTCC CCCTTTTAGC CGTCGCCACT 
181 CCATGCTCGT GA 
SEQ ID 138 

ITFAKFV^TELALARLMFHFDFSLPKGVKHEDLDVEEAAGITVRRKFPLI^ 



PIG. 70 

SEQ ID 139 D185-BD2 

1 ATCACATTT GCTAAGTTTG TGAATGAGCT AGCATTGGCA 
61 AGATTAATGT TCCATTTTGA TTTCTCGCTA CCAAAAGGAG TTAAGCATGC GGATTTGGAC 
121 GTGGAGGAAG CTGCTGGAAT TACTGTTAGA AGGAAGTTCC CCCTTTTAGC CGTCGCCACT 
181 CCATGCTCGT GA 
SEQ ID 140 

I TFAKFVNEL?U^AJUiMFHFDFSLPKGVKHADLD\^ 
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SEQ ID 141 D176-BG2 

1 CA AAATTTTGCC ATGTTAGAAG CAAAGACTAC rTTGGCTATG 
121 ATCCTACAAC GCTTCTCCTT TGAACTGTCT CCATCTTATG CACATGCTCC TCAGTCCATA 
181 ATAACTTTGC AACCCCAGTA TGGTGCTCCA CTTATTTTGC ATAAAATATA 6 
SEQ ID 142 

QNFAMLEAKTTLAMILQRFSFELSPSYAHAPQSIITLQPQYGAPLILHKI 



FIG. 72 

SBQ ID 143 Dld5-BD3 

1 ATTATCCTT GCACTGCCAA TTCTTGGCAT TACCTTGGGA 
61 CGCTTGGTGC AGAACTTTGA GTTGTTGCCT CCTCCAGGAC AGTCAAAGCT TGACACAACA 
121 GAGAAAGGCG GGCAATTCAG TCTGCACATT TTGAAGCATT CCACCATTGT GATGAAACCA 
181 AGATCTTTTT AA 
SBQ ID 144 

IILALPILGITLGRLVQ^TFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVMKPRSF 



FIG. 73 

SHQ ID 145 D176-BC3 

1 C AAAATTTTGC CATGTTAGAA GCAAAGACTA CTTTGGCTAT 

121 GATCCTACAA CGCTTCTCCT TTGAACTGTC TCCATCTTAT GCACATGCTC CTCAGTCCAT 

181 AATAACTTGO AACCCCAGTA TGGTGCTCCA CTTATTTTGC ATAAAATATA GTTTATTACT 

241 TGTAAGTAGT GTCTCGTTTT ATGTTAAGCA TGAGTCCAAA ATGTTAAGGC TTGTAGAACT 

3 01 GCAAAATGGG AATGCATTTG CACTCGTGCA CTGTAGATTG TTGTAA 
SHQ ID 146 

QlSTFAMLEAKTTIJUdlLQRFSFELSPSYAHAPQSlITCNPSMVLH^ 
FALVHCRLL 



PIG. 74 

SBQ ID 147 D176-BB3 

1 GCTGAT 

61 ATGGGGTTGC GAGCAGTTTC TTTGGCATTA GGTGCACTTA TTCAATGCTT TGACTGGCAA 
121 ATTGAGGAAG CGGAAAGCTT GGAGGAAAGC TATAATTCTA GAATGACTAT GCAGAACAAG 
181 CCTTTGAAGG TTGTCTGCAC TCCACGCGAA GATCTTGGCC AGCTTCTATC CCAACTCTAA 
SBQ ID 148 

ADMGLRAVSLALGALIQCFDWQIEEAESLEESYNSF^TMQNKPLKWCTPR^^ 
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NAME D89-AB1 

ORGANISM NI COT I ANA TABACUM 

SEQ. ID. NO, 149 

1 CTTCCTTCCT AAGTCCTAAC TAAAAATGGA GATTCAGTTT TCTAACTTAG TTGCATTCTT 
61 GCTCTTTCTC TCCAGCATCT TTCTTCTATT CAAAAAATGG AAAACCAGAA AACTAAATTT 
121 GCCTCCTGGT CCATGGAAAT TACCTTTTAT TGGAAGTTTA CACCATTTGG CTGTGGCAGG 
181 TCCACTTCCT CACCATGGCC TAAAAAATTT AGCCAAACGC TATGGTCCTC TTATGCATTT 
241 ACAACTTGGA CAAATTCCTA CACTCATCAT ATCATCACCT CAAATGGCAA AAGAAGTACT 
3 01 .A.AAAACTCAC GACCTCGCTT TTGCCACTAG ACCAAAGCTT GTCGCGGCCG ACATCATTCA 
3 61 CTACGACAGC ACGGACATAG CATTTTCTCC GTACGGTGAA TACTGGAGAC AAATTCGTAA 
421 AATTTGCATA TTGGAACTCT TGAGTGCCAA GATGGTCAAA TTTTTTAGCT CGATTCGCCA 
481 AGATGAGCTC TCGAAaATGC TCTCATCTAT ACGAACGACA CCCAATCTTA CAGTCAATCT 
541 TACTGACAAA ATTTTTTGGT TTACGAGTTC GGTAACTTGT AGATCAGCTT TAGGGAAGAT 
601 ATGTGGTGAC CAAGACAAAT TGATCATTTT TATGAGGGAA ATAATATCAT TGGCAGGTGG 
661 ATTTAGTATT GCTGATTTTT TCCCTACATG GAAAATGATT CATGATATTG ATGGTTCGAA 
721 ATCTAAACTG GTGAAAGCAC ATCGTAAGAT TGATGAAATT TTGGGAAATG TTGTTGATGA 
7 81 GC ACAAAAAG AACAGAGCAG ATGGCAAGAA GGGTAATGGT GAATTTGGTG GTGAAGATTT 
841 GATTGATGTA TTGTTAAGAG TTAGAGAAAG TGGAGAAGTT CAAATTCCTA TCACAAATGA 
901 CAATATCAAA TCAATATTAA TCGACATGTT CTCTGCAGGA TCTGAAACAT CATCGACGAC 
961 TATAATTTGG GCATTAGCTG AAATGATGAA GAAACCAAGT GTTTTAGCAA AGGCACAAGC 
10 21 TGAAGTAAGG CAAGCTTTGA AGGAGAAAAA AGGTTTTCAA CAGATTGATC TTGATGAGCT 
1081 AAAATATCTC AAGTTAGTAA TCAAAGAAAC CTTAAGAATG CACCCTCCAA TTCCTCTATT 
1141 AGTTCCTAGA GAATGTATGG AGGATACAAA GATTGATGGT TACAATATAC CTTTCAAAAC 
1201 AAGAGTCATA GTTAATGCAT GGGCAATCGG ACGAGATCCA GAAAGTTGGG ATGACCCCGA 
1261 AAGCTTTATG CCAGAGAGAT TTGAGAATAG TTCTATTGAC TTTCTTGGAA ATCATCATCA 
13 21 GTTTATACCA TTTGGTGCAG GAAGAAGGAT TTGTCCGGGA ATGCTATTTG GTTTAGCTAA 

13 81 TGTTGGACAA CCTTTAGCTC AGTTACTTTA TCACTTCGAT TGGAAACTCC CTAATGGACA 

14 41 AAGTCATGAG AATTTCGACA TGACTGAGTC ACCTGGAATT TCTGCTACAA GAAAGGATGA 
1501 TCTTGTTTTG ATTGCCACTC CTTATGATTC TTATTAAGCA GTAGCAGAAA TAAAAAGCCG 
1561 GGGCAAACAG AAAAAA 



SEQ. ID. NO. 150 

1 MEIQFSNLVA FLLFLSSIFL, LFKKWKTRKL NLPPGPWKLP FIGSLHHLAV AGPLPHHGLK 
61 NLAKRYGPLM HLQLGQIPTL IISSPQMAKE VLKTHDLAFA TRPKLVAADI IHYDSTDIAF 
121 SPYGEYWRQI RKICILELLS AKHVKFFSSI RQDELSKHLS SIRTTPNLTV NLTDKIFWFT 
181 SSVTCRSALG KICGDQDKLI IFIvfREIISLA GGFSIADFFP TWKMIHDIDG SKSKLVKAHR 
241 KIDEILGNW DEHKKNRADG KKGNGEFGGE DLID\1.LRVR ESGEVQIPIT NDNIKSILID 
3 01 MFSAGSETSS TTIIWALAEM MKKPSVLAKA QAEVKQALKE KKGFQQIDLD ELKYLKLVIK 
3 61 ETLRMHPPIP LLVPRECMED TKIDGYNIPF KTRVIVNAWA IGRDPESWDD PESFMPERFE 
421 NSSI0FLGNH HQFIPFGAGR RICPGMLFGL ANVGQPLAQL LYHFDWKLPN GQSHENFDMT 
481 ESPGISATRK DDLVLIATPY DSY 
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NAME D89-AD2 

ORGANISM NI COT I ANA TABACUM 

SEQ. ID. NO. 151 

1 TCCTTCTTCC TTCCTAAGTC CTAACTAAAA ATGGAGATTC AGTTTTCTAA 
61 TTCTTGCTCT TTCTCTCCAG CATCTTTCTT CTATTCAAAA AATGGAAAAC 
121 AATTTGCCTC CTGGTCCATG GAAATTACCT TTTATTGGAA GTTTACACCA 
181 GCAGGTCCAC TTCCTCACCA TGGCCTAAAA AATTTAGCCA AACGCTATGG 
241 CATTTACAAC TTGGACAAAT TCCTACACTC ATCATATCAT CACCTCA-\AT 
301 GTACTAAAAA CTCACGACCT CGCTTTTGCC ACTAGACCAA AGCTTGTCGT 
3 61 ATTCACTACG ACAGCACGGA CATAGCATTT TCTCCGTACG GTGAATACTG 
421 CGTAAAATTT GCATATTGGA ACTCTTGAGT GCCAAGATGG TCAAATTTTT 
481 CGCCAAGATG AGCTCTCGAA GATGCTCTCA TCTATACGAA CGACACCCAA 
541 AATCTTACTG ACAAAATTTT TTGGTTTACG AGTTCGGTAA CTTGTAGATC 
601 AAGATATGTG GTGACCAAGA CAAATTGATC ATTTTTATGA GGGAAATAAT 
661 GGTGGATTTA GTATTGCTGA TTTTTTCCCT ACATGGAAAA TGATTCATGA 
721 TCGAAATCTA AACTGGTGAA AGCACATCGT AAGATTGATG -^UATTTTGGG 
781 GATGAGCACA AAAAGAACAG AGCAGATGGC AAGAAGGGTA ATGGTGAATT 
841 GATTTGATTG ATGTATTGTT AAGAGTTAGA GAAAGTGGAG AAGTTCAAAT 
901 AATGACAATA TCAAATCAAT ATTAATCGAC ATGTTCTCTG CGGGATCTGA 
961 ACGACTATAA TTTGGGCATT AGCTGAAATG ATGAAGAAAC CAAGTGTTTT 
1021 CAAGCTGAAG TAAGGCAAGC TTTGAAGGAG AAAAAAGGTT TTCAACAGAT 
1081 GAGCTAAAAT ATCTCAAGTT AGTAATCAAA GAAACCTTAA GAATGCACCC 
1141 CTATTAGTTC CTAGAGAATG TATGGAGGAT ACAAAGATTG ATGGTTACAA 
1201 AAAACAAGAG TCATAGTTAA TGCATGGGCA ATCGGACGAG ATCCAGAAAG 
1261 CCCGAAAGCT TTATGCCAGA GAGATTTGAG AATAGTTCTA TTGACTTTCT 
1321 CATCAGTTTA TACCATTTGG TGCAGGAAGA AGGATTTGTC CGGGAATGCT 
1381 GCTAATGTTG GACAACCTTT AGCTCAGTTA CTTTATCACT TCGATTGGAA 
1441 GGACAAAGTC ATGAGAATTT CGACATGACT GAGTCACCTG GAATTTCTGC 
1501 GATGATCTTG TTTTGATTGC CACTCCTTAT GATTCTTATT AAGCAGTAGC 
1561 AGCCGGGOCA AACAGAAAAA A 



CTTAGTTGCA 

CAGAAAACTA 

TTTGGCTGTG 

TCCTCTTATG 

GGCAAAAGAA 

GGCCGACATC 

GAGACA.AATT 

TAGCTCGATT 

TCTTACAGTC 

AGCTTTAGGG 

ATCATTGGCA 

TATTGATGGT 

AAATGTTGTT 

TGGTGGTGAA 

TCCTATCACA 

AACATCATCG 

AGCAAAGGCA 

TGATCTTGAT 

TCCAATTCCT 

TATACCTTTC 

TTGGGATGAC > 

TGGAAATCAT 

ATTTGGTTTA 

ACTCCCTAAT 

TACAAGAAAG 

AGAAATAAAA 



SEQ. ID. m, 152 

1 MEIQFSNLVA FLLFLSSIFL LFKKWKTRKIi NLPPGPWKLP FIGSLHHLAV AGPLPHHGLK 

61 NLAKRYGPLM HLQLGQIPTL IISSPQMAKE VLKTHDLAFA TRPKLWADI IHYDSTDIAF 

121 SPYGE^T^QI RKICILELLS AKMVKFFSSI RQDELSKl^S SIRTTPNLTV NLTDKIFWFT 

181 SSVTCRSALG KICGDQDKLI IFMREIISLA GGFSIADFFP TWKMIHDIDG SKSKLVKAHR 

241 KIDEILGNW DEHKKNRADG KKGNGEFGGE DLIDVLLRVR ESGEVQIPIT NDNIKSILID 

301 MFSAGSETSS TTIIWALAEM MKKPSVLAKA QAEVRQALKE KKGFQQIDLD ELKYLKLVIK 

3 61 ETLRMHPPIP LLVPRECI4ED TKIDGYNIPF KTRVIVNAWA IGRDPESWDD PESFMPERFE 

421 NSSIDFLGNH HQFIPFGAGR RICPGMLFGL ANVGQPLAQL LYHFDWKLPN GQSHENFDMT 

481 ESPGISATRK DDLVLIATPY DSY 
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NAME 
ORGANISM 
SEQ. ID. NO 
1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 



D90A-BB3 
NICOTIANA 
153 
CAACTGCAGT 
TCCTATTTCT 
AAAAATTGCC 
GTGGACTACC 
TTCAATTAGG 
TAAAAACTCA 
GTTACAATAG 
AAATATGTGT 
GCGATGAAGT 
TTACGGAAAG 
TATTCAAGGA 
GGTTTGATGT 
AGGGTAAGAT 
AGCACAAGAA 
TTGATGTTCT 
ACATCAAAGC 
TTGTGTGGGC 
AAGTAAGAGA 
ACTATCTAAA 
TCCCAAGAGA 
AAGTCATGGT 
CTTTTAAGCC 
ATCTTCCATT 
CTTATTTGCC 
AACCAAGCGA 
TTTACTTGGT 



TABACUM 

TTGAAGATAC 
ATCTTTTCTC 
ACCAGGTCCA 
ACACCATGTC 
TGAAGTTTCT 
TGACATCGCT 
GTCTGATCTT 
CTTGGAAGTG 
TCTTCGTCTC 
GATCTTTTTG 
GCAAGACAAA 
GGCTGACATA 
TATGAATGCA 
AAATCTTGCA 
TCTAAAACTT 
TATAATCTTT 
TATGGTGGAA 
TGCATTTAGA 
GTTAGTCATT 
ATGTAGGGAA 
TAATGTTTGG 
AGAGAGATTT 
TGGTGGTGGA 
ATTGGCTCAA 
CTTGGACTTG 
TGCGACTCCT 



CAACTAACCA 
TTTTTGTTAA 
TGGAAACTAC 
CTTAGAGATT 
GCAGTTGTGG 
TTCGCGTCTA 
GCGTTTTGCC 
CTCAGTGCCA 
CTTAATTTTA 
TTCACAAGCT 
TTTATACAAC 
TTCCCTTCAT 
CACCATAAGG 
ATTGGGAAAA 
ATGAATGATG 
GACATGTTTG 
ATGGTGAAAA 
GAAAAAGAAA 
AAAGAAACTC 
GAGACAAATA 
GCATTGGGAA 
GAGCAGTGCT 
AGGAGGATTT 
TTACTTTATC 
ACTGAGTTGG 
TATCAACCTC 



AAATGCAGTT 
GGAAATGGAA 
CAATACTAGG 
TAGCCAAAAA 
TTACTTCTCC 
GGCCTAGCCT 
CCTATGGCGA 
AGAATGTTCG 
TCCGGTCATC 
CCATGACATG 
TAATTAAAGA 
ACAAGTCTCT 
TAGATGCTAT 
CTAATGGAGC 
GAGGCCTTCA 
CTGCTGGAAC 
ATCCAACTGT 
CTTTTGATGA 
TAAGACTTCA 
TAAACGGCTA 
GAGATCCAAA 
CTAAGGATTT 
GTCCAGGGAT 
ACTTTGATTG 
TTGGAGTAAC 
CTCAAAAC 



CTTCAGCTTG 
GAACTCGAAT 
AAGTATGCTT 
ATATGGACCG 
TGATATGGCA 
TTTGGCCCCG 
TTATTGGAGA 
GACATATAGC 
TTCTGGTGAG 
TAGATCAGCG 
AGTTATACTC 
TCATGTGCTC 
TGTTGAGAAT 
GTTAGGAGGT 
ATTTCCTATC 
AGAGACTTCA 
ATTTGCGAAA 
AAATGATGTG 
TCCACCGGTT 
CACTATTCCT 
ATATTGGGAT 
TGTTGGTAAT 
TTCGTTTGGT 
GGAACTCCCC 
TGCCGCTAGA 



GTTTCCATTT 

AGCCAAAGGA 
CATATGGTTG 
CTTATGCACC 
AAAGAAGTAC 
GAGATTGTCT 
CAAATGCGTA 
TCTATTAGGC 
CCTGTTAATA 
TTTGGGCAAG 
TTAGCAGGAG 
AGTGGAATGA 
GTCATCAACG 
GAAGATTTAA 
ACCAACGACA 
TCGTCAACAA 
GCTCAAGCAG 
GAGGAGCTAA 
CCACTTTTGC 
GTAAAGACCA 
GATGCAGAAA 
AATTTTGAAT 
TTAGCTAATG 
ACTGGAATCA 
AAAAGTGACC 



SEQ. ID, NO. 154 

1 MQFFSLVSIF LFLSFLFLLR KWKNSNSQRK KLPPGPWKLP ILGSl-lLHlvIVG GLPHHVLRDL 
61 AKia^GPLMHL QLGEVSAVW TSPDMAKEVL KTHDIAFASR PSLLAPEIVC YmSDLAFCP 
121 YGDYWRQMRK ICVLEVLSAK NVRTYSSIRR DEVLRLLNFI RSSSGEPVNI TERIFLFTSS 
181 MTCRSAFGQV FKEQDKFIQL IKEVILLAGG FDVADIFPSY KSLHVLSGMK GKIMNAHHKV 
241 DAIVENVINE HKKNLAIGKT NGALGGEDLI DVLLKLMNDG GLQFPITNDN IKAIIFDMFA 

3 01 AGTETSSSTI WAMVEl-lVKN PTVFAKAQAE VRDAFREKET FDENDVEELN YLKLVIKETL 
361 RliHPPVPLLL PRECREETNI NGYTIPVKTK VMVNVWALGR DPKYWDDAET FKPERFEQCS 

4 21 KDFVGNNFEY LPFGGGRRIC PGISFGLANA YLPLAQLLYH FDWELPTGIK PSDLDLTELV 
481 GVTAARKSDL YLVATPYQPP QN 
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NAME D95-AG1 

ORGAITISM NICOTIANA TABACUM 

SEQ. ID. NO. 155 

1 AAAAGATGTC TTCATTTTCC ACATCTTCTG CCACTTCTAA TTCCAAACTT CCAGTTCGAG 

61 AAATCCCAGG AGACTATGGT TTCCCCTTTT TTGGAGCCAT AAAAGATAGA TATGACTACT 

121 TCTACAACCT CGGCACAGAC GAATTCTTTC TTACCAAAAT GCAAAAATAC AACTCTACTG 

181 TCTTTAGAAC CAACATGCCA CCAGGTCCAT TCATTGCTAA AAATCCCAAA GTAATTGTTC 

241 TCCTCGATGC CAAAACATTT CCCGTTCTTT TCGACAACTC TAAAGTCGAA AAAATGAACG 

301 TTCTTGATGG CACGTACGTG CCATCTACTG ATTTCTATGG CGGATATCGC CCGTGTGCTT 

3 61 ATCTTGATCC TTCTGAGTCA ACTCATGCCA CACTTAAAGG GTTCTTTTTA TCTTTAATCT 

421 CCCAGCTTCA TAATCAATTT ATTCCTTTAT TTAGAACCTC AATTTCTGGT CTTTTCGCAA 

481 ATCTTGAGAA TGAGATTTCC CAAAATGGCA AAGCGAACTT CAACAATATC AGCGACATTA 

541 TGTCATTCGA TTTTGTTTTT CGTTTGTTAT GTGACAAGAC CAGTCCCCAT GACACAAATC 

601 TTGGCTCTAA TGGACCAAAA CTCTTTGATA TATGGCTGTT GCCTC AACTT GCTCC ATTGT 

661 TTAGTCTAGG TCTAAAATTT GTGCCGAACT TTCTGGAAGA TTTAATGTTG CATACTTTTC 

721 CCTTGCCATT TTTTCTAGTG AGATCGAATT ACCAGAAGCT TTATGATGCT TTTAGCAAGC 

781 ATGCCGAAAG TACACTGAAT GAAGCAGAGA AGAATGGGAT CAAAAGAGAC GAAGCATGCC 

841 ACAACTTAGT TTTTCTTGCA GGTTTCAATG CTTATGGTGG GATGAAAGTT TTATTCCCTG 

901 CACTGATAAA GTGGGTCGCC AATGGAGGAA AGAGTTTACA CACTCGGCTG GCAAATGAAA 

961 TCAGGACAAT TATCAAAGAA GAATGTGGGA CC ATAACTCT ATCAGCAATC AACAAGATGA 

1021 GTTTAGTAAA ATCAGTAGTG TATGAAGTAT TAAGAATTGA ACCTCCAGTT CCATTCCAAT 

1081 ATGGTAAAGC CAAAGAAGAT ATCATAATCC AAAGCCATGA TTCAACTTTC TTAGTCAAGA 

1141 AAGGTGAAAT GATCTTTGGA TATCAGCCTT TTGCTACAAA AGATCCAAAG ATTTTTGACA 

1201 AACCAGAGGA GTTTATTCCG GAGAGGTTCA TGGCCGAAGG GGAAAAATTA TTAAAGTATG 

1261 TGTATTGGTC AAATGCAAGA GAGACAGATG ATCCAACGGT GGACAACAAA CAATGCCCAG 

1321 CGAAAAATCT TGTCGTGCTT TTGTGCAGGT TGATGTTGGT GGAGGTTTTC ATGCGTTACG 

1381 ACACATTCAC AGTGGAGTCA ACAAAGCTCT TTCTTGGGTC ATCAGTAACG TTCACGACTC 

1441 TGGAAAAAGC GAO^TGAGTT TCAGATATCT TAATTGTAGG CTGCAAATAA TAATGTGGTC 
1501 ATTCTGCAAA TTATTGTACT TGTGCTGATG 

SEQ. ID. NO. 156 

1 MSSFSTSSAT SNSKLPVREI PGDYGFPFFG AIKDRYDYFY NLGTDEFFLT KMQKYNSTVF 

61 RTNMPPGPFI AKNPKVIVLL DAKTFPVLFD NSKVEKMNVL DGTYVPSTDF YGGVRPCAYL 

121 DPSESTHATL KGFFLSLISQ LHNQFIPLFR TSISGLFANL ENEISQNGKA NFNNISDIMS 

181 FDFVPRLLCD KTSPHDTNLG SNGPKLFDIW LLPQUVPLFS LGLKFVPNFL EDLMLHTFPL 

241 PFFLVRSNYQ KLYDAFSKHA ESTLNEAEKN GIKRDEACHN LVFLAGFNAY GGMKVLFPAL 

301 IKWVANGGKS LHTRLANEIR TIIKEECGTI TLSAINKMSL VT<SVVYEVLR lEPPVPFQYG 

361 KAKEDIIIQS HDSTFLVKKG EldlFGYQPFA TKDPKIFDKP EEFIPERFMA EGEKLLKYVY 

421 WSNARETDDP TVDNKQCPAK ITLVVLLCRLM LVEVFMRTOT FTVESTKLFL GSSVTFTTLE 
481 KAT 
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NAME D96-AB6 

ORGANISM NICOTIANA TABACUM 

SEQ. ID. NO, 157 

1 CCAAAAATGG AGCTTCAATC TTCTCCTTTC AATTTAATTT CTTTGTTCCT CTTCTTTTCT 
61 TTTCATTTTA TTCTAGTGAA GAAATGGAAT GCCAAAATCC CAAAGTTACC TCCAGGTCCG 
121 TGGAGGCTTC CCTTTATTGG AAGCCTCCAT CACTTGAAGG GAAAACTTCC ACACCATAAT 
181 CTTAGAGATC TAGCGCGAAA ATATGGGCCT CTCATGTACT TACAACTCGG AGAAATTCCT 
241 GTAGTTGTAA TATCTTCGCC ACGTGTAGCA AAAGCTGTAC TAAAAACTCA TGATCTCGCT 
3 01 TTTGCAACTA GACCACGATT CATGTCCTCA GACATTGTGT TTTACAAAAG CAGGGACATC 
361 TCTTTTGCCC CATTTGGTGA TTACTGGAGA CAGATGCGTA AAATATTGAC TCAGG^CTC 
421 CTGAGTAACA AGATGCTCAA GTCATATAGC TTAATCCGAA AGGATGAGCT CTCGAAGCTC 
481 CTCTCATCGA TTCGTTTGGA AACAGGTTCT GCAGTGAACA TAAATGAAAA GCTTCTCTGG 
541 TTTACGAGCT GCATGACCTG TAGATTAGCC TTTGGAAAAA TATGCAATGA TCGGGATGAG 
601 TTGATCATGC TAATTAGGGA GATATTAACA TTATCAGGAG GATTTGATGT GGGTGATTTG 
661 TTCCCTTCCT GGAAATTACT TCATAATATG AGCAACATGA AAGCTAGGTT GACGAATGTA 
721 CACCACA.2^GT ATGATTTAGT TATGGAGAAC ATCATCAATG AGCACCAAGA GAATCATGCA 
7 81 GCAGGGATAA AGGGTAACAA CGAGTTTGGT GGCGAAGATA TGATCGATGC TCTACTGAGG 
841 GCTAAGGAGA ATAATGAGCT TCAATTTCCT ATCGAAAATG ACAACATGAA AGCAGTAATT 
901 CTGGACTTGT TTATTGCTGG AACTGAAACT TCATATACTG CAATTATATG GGCACTATCA 
961 GAATTGATGA AGCACCCAAG TGTGATGGCC AAGGCACAAG CTGAAGTGAG AAAAGTCTTC 
1021 AAAGAAAATG AAAATTTCGA CGAAAATGAT CTTGACAAGT TGCCATACTT AAAATCAGTG 
1081 ATTAAAGAAA CACTAAGGAT GCACCCTCCA GTTCCTTTGT TAGGGCCTAG AGAATGCAGG 
1141 GACCAAACAG AGATCGATGG CTACACTGTA CCTATTAAAG CTAGAGTTAT GGTTAATGCT 
1201 TGGGCGATAG GAAGAGATCC TGAAAGTTGG GAAGATCCTG AAAGTTTCAA ACCGGAGCGA 

12 61 TTTGAAAATA CTTCTGTTGA TCTTACAGGA AATCACTATC AGTTCATTCC TTTCGGTTCA 
1321 GGAAGAAGAA TGTGTCCAGG AATGTCGTTT GGTTTAGTTA ACACAGGGCA TCCTTTAGCC 

13 81 CAGTTGCTCT ATTGCTTTGA CTGGAAACTC CCTGACAAGG TTAATGCAAA TGATTTTCGC 

14 41 ACTACTGAAA CAAGTAGAGT TTTTGCAGCA AGCAAAGATG ACCTCTACTT GATTCCCACA 

15 01 AATCACAGGG AGCAAGAATA GCTTAATTTA ATGGAGTTCT TGGAAGAATT AAAGAAGAAG 
15 61 GGCTATATAG GTGAGATTTT TTGTATGGTT GCA 

SEQ. ID. NO. 158 

1 MELQSSPFNL ISLFLFFSFH FILVKKWNAK IPKLPPGPWR LPFIGSLHHL KGKLPHHNLR 

61 DLARKVGPLM YLQLGEIPW VISSPRVAKA VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 

121 APFGDYWRQM RKILTQELLS ^^a^KSYSLI RKDELSKLLS SIRLETGSAV NINEKLLWFT 

181 SCMTCRLAFG KICNDRDELI MLIREILTLS GGFDVGDLFP SWKLLHNMSN MKARLTNVHH 

241 KYDLVMENII NEHQENHAAG IKGNNEFGGE DMIDALLRAK ENNELQFPIE NDNMKAVILD 

3 01 LFIAGTETSY TAIIWALSEL t»lKHPSVMAKA QAEVRKVFKE NENFDENDLD KLPYLKSVIK 

361 ETLRMHPPVP LLGPRECRDQ TEIDGYTVPI KAKVIWNAWA IGRDPESWED PESFKPERFE 

421 NTSVDLTGNH YQFIPFGSGR Rl^lCPGMSFGL VNTGHPLAQL LYCFDWKIiPD KVNANDFRTT 

481 ETSRVFAASK DDLYLIPTNH REQE 
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NAME D96~AC2 

ORGANISM NICOTIANA TABACUM 

SEQ. ID. NO. 159 

1 CTTCTTCCAA 
61 TTTTCTTTTC 
121 GGTCCGTGOA 
181 CATAATCTTA 
241 ATTCCTGTAG 



301 CTCGCTTTTG 
361 GACATCTCTT 
421 GAACTCCTGA 
481 AAGCTCCTCT 
541 CTCTGGTTTA 
601 GATGAGTTGA 
661 GATTTGTTCC 
721 AATGTACACC 
781 CATGCAGCAG 
841 CTGAGGGCTA 
901 GTAATTCTGG 
961 CTATCAGAAT 
1021 GTCTTCAAAG 
1081 TCAGTGATTA 
1141 TGCAGGGACC 
1201 AATGCTTGGG 
1261 GAGCGATTTG 
1321 GGTTCAGGAA 
1381 TTAGCCCAGT 
1441 TTTCGCACTA 
1501 CCCACAAATC 
1561 AAGAAGGGCT 



aaatggagct 
tttttattct 
ggcttccctt 
gagatctagc 

TTGTAATATC 
CAACTAGACC 
TTGCCCCATT 
GTAACAAGAT 
CATCGATTCG 
CGAGCTGCAT 
TCATGCTAAT 
CTTCCTGGAA 
ACAAGTATGA 
GGATAAAGGG 
AGGAGAATAA 
ACTTGTTTAT 
TGATGAAGCA 
AAAATGAAAA 
AAGAAACACT 
AAACAGAGAT 
CGATAGGAAG 
AAAATACTTC 
GAAGAATGTG 
TGCTCTATTG 
CTGAAACAAG 
ACAGGGAGCA 
ATATAGGTGA 



TCAATCTTCT 
AGTGAAGAAA 
TATTGGAAGC 
GCGAAAATAT 
TTCGCCACGT 
ACGATTCATG 
TGGTGATTAC 
GCTCAAGTCA 
TTTGGAAACA 
GACCTGTAGA 
TAGGGAGATA 
ATTACTTCAT 
TTTAGTTATG 
TAACAACGAG 
TGAGCTTCAA 
TGCTGGAACT 
CCCAAGTGTG 
TTTCGACGAA 
AA.GGATGCAC 
CGATGGCTAC 
AGATCCTGAA 
TGTTGATCTT 
TCCAGGAATG 
CTTTGACTGG 
TAGAGTTTTT 
AGAATAGCTT 
GATTTTTTGT 



CCTTTCAATT 
TGGAATGCCA 
CTCCATCACT 
GGACCTCTCA 
GTAGCAAAAG 
TCCTCAGACA 
TGGAGACAGA 
TATAGCTTAA 
GGTTCTGCAG 
TTAGCCTTTG 
TTAACATTAT 
AATATGAGCA 
GAGAACATCA 
TTTGGTGGCG 
TTTCCTATCG 
GAAACTTCAT 
ATGGCCAAGG 
AATGATCTTG 
CCTCCAGTTC 
ACTGTACCTA 
AGTTGGGAAG 
ACAGGAAATC 
TCGTTTGGTT 
AAACTCCCTG 
GCAGCAAGCA 
AATTTAATGG 
ATGGTTGCA 



TAATTTCTTT 
AAATCCCAAA 
TGAAGGGAAA 
TGTACTTACA 
CTGTACTAAA 
TTGTGTTTTA 
TGCGTAAA.AT- 
TCCGAAAGGA 
TGAACATAAA 
GAAAAATATG 
CAGGAGGATT 
ACATGAAAGC 
TCAATGAGCA 
AAGATATGAT 
AAAATGACAA 
ATACTGCAAT 
CACAAGCTGA 
ACAAGTTGCC 
CTTTGTTAGG 
TTAAAGCTAG 
ATCCTGAAAG 
ACTATCAGTT 
TAGTTAACAC 
ACAAGGTTAA 
AAGATGACCT 
AGTTCTTGGA 



GTrCCTCTTC 
GTTACCTCCA 
ACTTCCACAC 
ACTCGGAGAA 
AACTCATGAT 
CAAAAGCAGG 
ATTGACTCAG 
TGAGCTCTCG 
TGAAAAGCTT 
CAATGATCGG 
TGATGTGGGT 
TAGGTTGACG 
CCAAGAGAAT 
CGATGCTCTA 
CATGAAAGCA 
TATATGGGCA 
AGTGAGAAAA 
ATACTTAAAA 
GCCTAGAGAA 
AGTTATGGTT 
TTTCAAACCG 
CATTCCTTTC 
AGGGCATCCT 
TGCAAATGAT 
CTACTTGATT 
AGAATTAAAG 



SEQ, ID. NO. 160 

1 MELQSSPFNL ISLFLFFSFL FILVKKWNAK IPKLPPGPWR LPFIGSLHHL KGKLPHHNLR 

61 DLARKYGPLM YLQLGEIPW VISSPRVAKA VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 

121 APFGDYWRQM RKILTQELLS NKMLKSYSLI RKDELSKLLS SIRLETGSAV NIl^KLLWFT 

181 SCMTCRLAFG KICNDRDELI MLIREILTLS GGFDVGDLFP SWKIiLHNMSN MKARLTNVHH 

241 KYDLVMENII NEHQENHAAG IKGNNEFGGE DMIDALLRAK ENNELQFPIE NDNMKAVILD 

301 LFIAGTETSY TAIIWALSEL MKHPSVMAKA QAEVRKVFKE NENFDE]>JDLD KLPYLKSVIK 

361 ETLRMHPPVP LLGPRECRDQ TEIDGYTVPI KARVMVNAWA IGRDPESV7ED PESFKPERFE 

421 NTSVDLTGNH YQFIPFGSGR RMCPGMSFGL VNTGHPLAQL LYCFDWKLPD KVNANDFRTT 

481 ETSRVFAASK DDLYLIPTNH REQE 
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NAME D98-AA1 

ORGANISM NICOTIANA TABACUM 

SEQ. ID. NO. 161 

1 CTTTCTTTCT TGTACCGAGA TGGAGTTTCA ACACTTGGTT TCGTTCTTGC TATTCATCTC 
61 CTTCATCTTT CTTCTAATTC .AAAAATGGAG GAAATCGAAA AAGCTGCCAC CTGGTCCGTG 
121 GAGGCTACCT ATTATTGGAA GTGTGCATCA CTTGACAAGT GGAGTACCAC ATCGAGTTCT 
181 CAGAAATTTA TCACAAAAAT TTGGCCCGAT CATGTACTTG CAGCTCGGGG AAGTTCCCAC 

2 41 AGTAGTTGTA TCCTCCCCAC ACATGGCCAA ACAAATTTTA AAAACTCATG ACCTCGCTTT 

3 01 TGCATCTAGG CCAGAAATCA TGATGGGAAA AATTATTTGC TACGATTGTA AGGACATTGC 
361 CTTTTCCCCG TATGGTGATT ATTGGAGACA TATGCGTAAA WGAGCACCT TGGAACTACT 
421 TAGTGCCAAG ATGGTCAAGT CCTTCAGTCC AATTCGTCAA GATGAGCTCT CAAGTCTCCT 
481 ATCATCCATT GAATCAATGG GAAATTTGCC AATCAACTTA GTAGAAAAAC TTTTATGGTT 
541 TATGAATGCC GCGACATGTA GGTCAGCATT TGGGAAAGTG TGTAAAGATC AAAAAGAGTT 
601 GATAACATTG ATTCAACGAG CAGAATCATT ATCTGGTGGA TTCGAGCTGG CTGATTTGTT 
661 CCCTTCGAAG AAGTTTCTAC ATGGTATTAG TGGGATGCGA TCTAAACTAA TGGAAGCTCG 
721 TAACAAGATA GACGCAGTCT TGGACAACAT TATCAATGTG CACAGAGAGA ATCGGGCAAA 
781 TGGAAATAGT TGTAATGGTG AGTCTGGAAC TGTAGATTTC ATCGATGTTT TTCTAAGGGT 
841 CATGGAGAGT GGCGAATTAC CATTTCCGAT AGAAAATGAC AACATCAAAG CAGTTATTCT 
901 TGACATGTTC GTAGCAGGAT CTGACACATC ATCTTCAACC GTTATTTGGG CATTAACAGA 
961 AATGATGAAG AATCCAAAAG TCATGGCTAA AGCACAAGCT GAAGTGAGAG AAGCTTTTAA 

1021 AGGAAAGAAA GCATGTGATG AGGATACTGA TCTTGAAAAG CTTCATTACC TAAATTTAGT 
1081 GATCAAAGAG ACACTCCGAT TACACCCTCC AACTCCTCTA CTTGTCCCGC GAGAATGCAG 
1141 GGAGGAAACA GAGATAGAAG GATTCACTAT ACCATTGAAA AGCAAAGTCT TGGTTAACGT 
1201 ATGGGCAATT GGAAGAGATC CCGAGAATTG GAAAAATCCT GAATGTTTTA TACCAGAGAG 
1261 ATTCGAAAAT AGTTCTATTG AGTTTACTGG AAATCATTTT CAACTTCTTC CGTTTGGCGC 
1321 TGGAAGACGA ATTTGTCCAG GAATGCAATT TGGTTTGGCT CTTGTTACTC TGCCATTGGC 
1381 TCATTTGCTT CACAATTTTG ATTGGAAACT TCCCGAAGGA ATTAATGCAA GGGATTTGGA 
1441 CATGACAGAG GCAAATGGGA TATCTGCTAG AAGAGAAAAA GATCTTTACT TGATTGCTAC 
1501 TCCTTATGTA TCACCTCTTG ATTAACTCTG AAATTTTGCT TTAATGCTGC TTGCTTGCTT 
1561 CACT 

SEQ. ID, NO. 162 

1 MEFQHLVSFL LFISFIFLLI QKWRKSKKLP PGPWRLPIIG SVHHLTSGVP HRVLRNLSQK 

61 FGPIMYLQLG EVPTVWSSP HMAKQILKTH DLAFASRPEI M>iGKIICYDC KDIAFSPYGD 

121 YWKHMRKLST LELLSAKMVK SFSPIRQDEL SSLLSSIESM GMLPINLVEK LLWFMNAATC 

181 RSAFGKVCKD QKELITLIQR AESLSGGFEL ADLFPSKKFL HGISGMRSKL MEARNKIDAV 

241 LDNIINVHRE NRANGNSCNG ESGTVBFIDV FLRVMESGEL PFPIENDNIK AVILDMFVAG 

3 01 SDTSSSTVIW ALTEMMKITPK VMAKAQAEVR EAFKGKKACD EDTDLEKLHY LNLVIKETLR 

361 LHPPTPLLVP RECREETEIE GFTIPLKSKV LVNVWAIGRD PENWKNPECF IPERFENSSI 

421 EFTGNHFQLL PFGAGRRICP GMQFGLALVT LPLAHLLHNF DWKLPEGINA RDLDMTEANG 

481 ISARREKDLY LIATPYVSPL D 
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NAME D98-AG1 

ORGATJISM NICOTIANA TABACUM 

SEQ, ID, NO, 163 

1 CTTTCTTGTA CCGAGATGGA GTTTCAACAC TTGGTrrCGT TCTTGCTATT CATCTCCTTC 
61 ATCTTTCTTC TAATTCAAAA ATGGAGGAAA TCGAAAAAGC TGCCACCTGG TCCGTGGAGG 
121 CTACCTATTA TTGGAAGTGT GCATCACTTG ACAAGTGGAG TACCACATCG AGTTCTCAGA 
181 AATTTATCAC AAAAATTTGG CCCGATCATG TACTTGCAGC TCGGGGAAGT TCCCACAGTA 
241 GTTGTATCCT CCCCACACAT GGCCAAACAA ATTTTAAAAA CTCATGACCT CGCTTTTGCA 
301 TCTAGGCCAG AAATCATGAT GGGAAAAATT ATTTGCTACG ATTGTAAGGA CATTGCCTTT 
361 TCCCCGTATG GTGATTATTG GAGACATATG CGTAAATTGA GCACCTTGGA ACTACTTAGT 
421 GCCAAGATGG TCAAGTCCTT CAGTCCAATT CGTCAAGATG AGCTCTCAAG TCTCCTATCA 
481 TCCATTGAAT CAATGGGAAA TTTGCCAATC AACTTAGTAG AAAAACTTTT ATGGTTTATG 
541 AATGCCGCGA CATGTAGGTC AGCATTTGGG AAAGTGTGTA AAGATCAAAA AGAGTTGATA 
601 ACATTGATTC AACGAGCAGA ATCATTATCT GGTGGATTCG AGCTGGCTGA TTTGTTCCCT 
661 TCGAAGAAGT TTCTACATGG TATTAGTGGG ATGCGATCTA AACTAATGGA AGCTCGTAAC 
721 AAGATAGACG CAGTCTTGGA CAACATTATC AATGTGCACA GAGAGAATCG GGCAAATGGA 
781 AATAGTTGTA ATGGTGAGTC TGGAACTGTA GATTTCATCG ATGTTTTTCT AAGGGTCATG 
841 GAGAGTGGCG AATTACCATT TCCGATAGAA AATGACAACA TCAAAGCAGT TATTCTTGAC 
901 ATGTTCGTAG CAGGATCTGA CACATCATCT TCAACCGTTA TTTGGGCATT AACAGAAACG 
961 ATGAAGAATC CAAAAGTCAT GGC TAAAGCA CAAGCTGAAG TGAGAGAAGC TTTTAAAGGA 
1021 AAGAAAGCAT GTGATGAGGA TACTGATCTT GAAAAGCATC ATTACCTAAA TTTAGTGATC 
1081 AAAGAGACAC TCCGATTACA CCCTCCAACT CCTCTACTTG TCCCGCGAGA ATGCAGGGAG 
1141 GAAACAGAGA TAGAAGGATT CACTATACCA TTGAAAAGCA AAGTCTTGGT TAACGTATGG 
1201 GCAATTGGAA GAGATCCCGA GAATTGGAAA AATCCTGAAT GTTTTATACC AGAGAGATTC 
1261 GAAAATAGTT CTATTGAGTT TACTGGAAAT CATTTTCAAC TTCTTCCGTT TGGCGCTGGA 
1321 AGACGAATTT GTCCAGGAAT GCAATTTGGT TTGGCTCTTG TTACTCTGCC ATTGGCTCAT 
1381 TTGCTTCACA ATTTTGATTG GAAACTTCCC GAAGGAATTA ATGCAAGGGA TTTGGACATG 
1441 ACAGAGGCAA ATGGGATATC TGCTAGAAGA GAAAAAGATC TTTACTTGAT TGCTACTCCT 
1501 TATGTATCAC CTCTTGATTA ACTCTGAAAT TTTGCTTTAA TGCTGCTTGC TTGCTTCACT 

SEQ. ID. NO, 164 

1 MEFQHIiVSFL LFISFIFLLI QKWRKSKKLP PGPWRLPIIG SVUHhTSGVP HRVLRNLSQK 
61 FGPIMYLQLG EVPTVWSSP HMAKQILKTH DLAFASRPEI t^MGKIICYDC KDIAFSPYGD 
121 YWRHMRKLST LELLSAKMVK SFSPIRQDEL SShhSSlESH GNLPINLVEK LLWFMNAATC 
181 RSAFGKVCKD QKELITLIQR AESLSGGFEL ADLFPSKKFL HGISGMRSKL* MEARNKIDAV 
241 LDNIINVHRE InTRANGNSCNG ESGTVDFIDV FLRVMESGEL PPPIENDNIK AVILDMFVAG 
301 SDTSSSTVIW ALTETl-fKNPK VMAKAQAEVR EAFKGKKACD EDTDLEKHHY LNLVIKETLR 
361 LHPPTPI/LVP RECREETEIE GFTIPLKSKV LVNVWAIGRD PEl^JVTKNPECF IPERFENSSI 
421 EFTGNHFQLL PFGAGRRICP GMQFGLALVT LPLAHLLHNF DWKLPEGINA RDLDMTEANG 
481 ISARREKDLY LIATPYVSPL D 
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NAME D100-BE2 
ORGANISM NICOTIANA 
SEQ. ID. NO. 165 

1 CAAAAACAAA 
61 AGCTTTTTAC 
121 ATTTCTTACT 
181 CTTAGCCAAA 

2 41 AGTACTTTTG 

3 01 TTTCGCGAAT 
3 61 GGCTTGGAGT 
421 GTTTTCGACT 
481 GGTTAAGAGG 
541 TTTTGAGCTG 
601 GAATGTGGAG 
661 GATTGGCGGG 
721 ATTGGAGAAA 
781 TAAAGATTGC 
841 GAACAAGAAA 
901 CAAAGATGAA 
961 AGTTGGGACA 

1021 AGCACAAGCT 
1081 CAACAACCTA 
1141 ACCACTACTA 
1201 CGGAGGAACC 
1261 TGAACCAAGA 
1321 CAAAATGATG 
1381 AATGGTTGCC 
1441 AGAATTGGTT 
1501 GGCCAAGTGT 

SEQ. ID. NO. 166 

1 MVKMFTPIIY 
61 ERYGSVLLLE 
121 GDHWRNLRRI 
181 NVM14RTIAGK 
241 IVLQENRDEF 
3 01 RSLMLVLLSA 
3 61 LRCII^TETFR 
421 KPERFQGLDG 
481 TEGTGLTLPK 



TABACUM 



ATTCCAATGG 
ATTATCACAA 
TTCCCCTTTA 
ATCTCCGAAC 
GTTTCTTCAC 
CGTCCTCTTT 
TCGTACGGAG 
CATCGTCTTC 
CTCAATTCCT 
ATGCTCAATG 
GACATTGAGG 
GCGACGAATA 
AGTTTAATTG 
AGAAAAAGAA 
TGTTTAATTG 
ATCATCAGAA 
ATGGAATGGG 
GAAATTGATG 
CCTTACCTAC 
GTCCCACACG 
ATGTTACTTG 
AAGTTTAAAC 
CCTTTTGGTT 
TTGTCATTGG 
GATATGACTG 
AGCCCACGAC 



TTAACATGTT 
AACATTTCTT 
TTGGCCATCT 
GTTATGGCTC 
CATCTGCAGC 
TGATGGCTGG 
ATCATTGGAG 
AAATGCTACA 
CTGCCATAGC 
TTATGATGAG 
AAGCTACGAG 
TTGGCGACTT 
TGTTGCAAGA 
TGGAGAAAGA 
AAGTTTTGTT 
GCCTTATGCT 
CTTTATCATT 
AACATATAGG 
GTTGTATAAT 
AGTCGTCAGA 
TGAATTTGTG 
CAGAAAGATT 
CTGGACGAAG 
GATGTATTAT 
AAGGAACTGG 
CTAAAATGGC 



CACTCCAATT 
ACGCAAACTC 
TTATCTCTTC 

TGTTCTTCTA 
TGAAGAATGC 
AAAACATCTT 
AAATCTGCGA 
TGGGATTCGT 
TGAAAAATCT 
GACAATTGCT 
ATTCAAAGGT 
TTTGCCGGCG 
GAACAGAGAT 
AGGTACTGTT 
AACACTACAA 
TGTTCTATTA 
AATGTTAAAC 
ACATGAACGT 
CAACGAGACA 
GGAAACCACC 
GGC AATTCAC 
TCAAGGACTA 
GAGTTGTCCT 
TCAATGTTTT 
ACTTACTTTG 
TAATCTTCTC 



ATATACGCTC 
AGAAATAATC 
AAAAAACCAC 
CTCGAATTCG 
TTAACAAAAA 
GGATATAATT 
AGGATTACTT 
ATTGATGAAG 
GTGGATATGA 
GGAAAAAGAT 
TTGGTGCAAG 
TTGAAGTTAT 
GAGTTTATGC 
ACTGATTCAG 
GAAAATGAAC 
TCAGCTGGTA 
CACCCTGAAA 
TTAGTGGACG 
TTCCGAATGT 
GTAGGAGGCT 
AATGATCCAA 
GATGGTGTTA 
GGAGAAGGAT 
GATTGGCAAC 
CCTAAAGCTC 
TCTCAGATTT 



CTCTCCTTTT 
CACCAGCTCC 
TTCAACGTAC 
GTTCACGAAA 
ACGATATTAT 
TTACATCTTT 
CAGTTGAGAT 
TGAAATCTAT 
AGTCTATGTT 
ATTACGGTGA 
AGACTTTCAG 
TGGTGAGGAA 
AGGAATTAAT 
AAATTGAAGG 
CGG AATACTA 
CAGATACTTC 
CTCTGAAGAA 
AGTCGGACAT 
ACCCTGCAGG 
ACCGTGTACC 
AGCTATGGGA 
GAGATGGTTA 
TGGCTGTTCG 
GAATCGGCGA 
AACCTTTGGT 
GA 



APLLLAFYII 
FGSRKVLLVS 
TSVEMFSTHR 
RYYGENVEDI 
MQELIKDCRK 
GTDTSVGTME 
I4YPAGPLLVP 
VRDGYKMMPF 
AQPLVAKCSP 



TKHFLRKLRN 
SPSAAEECLT 
LQMLHGIRID 
EEATRFKGLV 
RMEKEGTVTD 
WALSLMLNHP 
HESSEETTVG 
GSGRRSCPGE 
RPKMANLLSQ 



NPPAPFLTFP 
KNBIIFANRP 

EVKSMVICRLN 
QETFRIGGAT 
SEIEGNKKCL 
ETLKKAQAEI 
GYRVPGGTML 
GLAVRMVALS 
I 



FIGHLYLFKK 
LLMAGKHLGY 
SSAIAEKSVD 
NIGDFLPALK 
lEVLLTLQEN 
DEHIGHERLV 
LVNLWAIHl>rD 
LGCIIQCFDW 



PLQRTIAKIS 
NFTSLAWSSY 
MKSMFFELML 
LLVRKLEKSL 
EPEYYKDEII 
DESDINNLPY 
PKLWDEPRKF 
QRIGEELVDM 
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NAME D100A-AC3 
ORGANISM NICOTIANA TABACUM 
SEQ, ID. UO. 167 

1 caaaaacaaa attccaatgg ttaacatgtt cactccaatt atatacgctc ctctcctttt 

61 agctttttac attatcacaa aacatttctt acgcaaactc agaaataacc caccagctcc 

121 atttcttact ttccccttta ttggccatct ttatctcttc aaaaaaccac ttcaacgtac 

181 cttagccaaa atctccgaac gttatggctc tgttcttcta ctcgaattcg gttcacgaaa 

241 agtacttttg gtttcttcac catctgcagc tgaagaatgc ttaacaaaaa acgatattat 

301 TTTCGCGAAT CGTCCTCTTT TGATGGCTGG AAAACATCTT GGATATAATT TTACTTCTTT 

3 61 GGCTTGGAGT TCGTACGGAG ATCACTGGAG AAATCTTCGT AGGATTACTT CAGTTGAGAT 
421 GTTTTCGACT CATCGTCTTC AAATGCTACA TGGAATTCGT ATTGATGAAG TGAAATCTAT 

4 81 GGTTAAGAGG CTCAATTCCT CTGCCATAGC TGAAAAATCT GTGGATATGA AGTCTATGTT 
541 TTTTGAGCTG ATGCTCAATG TTATGATGAG GACAATTGCT GGAAAAAGAT ATTACGGTGA 
601 GAATGTGGAG GACATTGAGG AAGCTACGAG ATTCAAAGGT TTGGTGCAAG Aa\CTTTCAG 
661 GATTGGCGGG GCGACGAATA TTGGCGACTT TTTGCCGGCG TTGAAGTTAT TGGTGAGGAA 
721 ATTGGAGAAA AGTTTAATTG TGTTGCAAGA GAACAGAGAT GAGTTTATGC AGGAATTAAT 
7 81 TAAAGATTGC AGAAAAAGAA TGGAGAAAGA AGGTACTGTT ACTGATTCAG AAATTGAAGG 
841 GAACAAGAAA TGTTTAATTG AAGTTTTGTT AACACTACAA GAAAATGAAC CGGAATACTA 
901 CAAAGATGAA ATCATCAGAA GCCTTATGCT TGTTCTATTA TCAGCTGGTA CAGATACTTC 
961 AGTTGGGACA ATGGAATGGG CTTTATCATT AATGTTAAAC CACCCTGAAA CTCTGAAGAA 

1021 AGCACAAGCT GAAATTGATG AACATATAGG ACATGAACGT TTAGTGGACG AGTCGGACAT 

1081 CAACAACCTA CCTTACCTAC GTTGTATAAT CAACGAGACA TTCCGAATGT ACCCTGCAGG 

1141 ACCACTACTA GTCCCACACG AGTCGTCAGA GGAAACCACC GTAGGAGGCT ACCGTGTACC 

1201 CGGAGGAACC ATGTTACTTG TGAATTTGTG GGCTATTCAC AATGATCCAA AGCTATGGGA 

12 61 TGAACCAAGA AAGTTTAAGC CAGAAAGATT TGAAGGACTA GAAGGTGTTA GAGACGGTTA 
1321 CAAAATGATG CCTTTTGGTT CTGGACGAAG GAGTTGTCCT GGAGAAGGAT TGGCTATTCG 

13 81 AATGGTTGCA TTGTCATTGG GATGTATTAT TCAATGCTTT GATTGGCAAC GACTTGGGGA 
1441 AGGATTGGTT GATAAGACTG AAGGAACTGG ACTTACTTTG CCTAAAGCTC AACCTTTAGT 
1501 GGCCAAGTGT AGCCCACGAC CTATAATGGC TAATCTTCTT TCTCAGATTT GAACATAATT 
1561 GGTTTCTACC AAACATCCCC AAACTAGAAT ATTATTATTG GTTACATATA CAATGTAATC 
1621 AATTTTGAAC CATATTATAT CTCAATGTAT TCCTTTTTAA AAAAAAAAAA AAAAA 



SEQ. ID. NO. 168 

1 MVNMFTPIIY APLLLAFYII TKHFLRKXRN NPPAPFLTFP FIGHLYLFKK PLQRTLAKIS 
61 ERYGSVLLLE FGSRKVLLVS SPSAAEECLT KNDIIFANRP LLMAGKHLGY NFTSLAWSSY 
121 GDHWRNLRRI TSVEMFSTHR LQMLHGIRID EVKSMVKRLN SSAIAEKSVD MKSl^FEU^ 
181 NVMMRTIAGK RYYGEl^TVEDI EEATRFKGLV QETFRIGGAT NIGDFLPALK LLVHKLEKSL 
241 IVLQENRDEF MQELIKDCRK RMEKEGTVTD SEIEGNKKCL lEVLLTLQEN EPEYYKDEII 
3 01 RSLMLVLLSA GTDTSVGTME WALSLMLNHP ETLKKAQAEI DEHIGHERLV DESDINNLPY 
3 61 LRCIINETFR MYPAGPLLVP HESSEETTVG GYRVPGGTML L\^WAIHin3 PKLWDEPRKF 
421 KPERFEGLEG VRDGYKMMPF GSGRRSCPGE GLAIRMVALS LGCIIQCFDW QRLGEGLVDK 
481 TEGTGLTLPK AQPLVAKCSP RPIMANLLSQ I 
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NAME D104A-AE8 
ORGANISM NICOTIANA 
SEQ, ID. NO. 169 

1 CAACACGCTT 
61 TAATAATTAT 
121 AAGCTACGCC 
181 AATCTCGCCG 
241 TGATGATGGA 
3 01 TTTTTCACCT 
3 61 GCCAAGTATT 
421 GTTACCTAAC 
481 AGATGAGAAA 
541 CAGTTCGAGA 
601 TTAACTTAGG 
661 GAACTTGTTC 

7 21 TATTTGGTGC 
781 GTCTAAATAT 

8 41 TTGATGACCA 
901 GAGAAACTGA 
961 AGTCCGAAGA 

1021 TGGATGTAAT 
1081 AGCTTATGAG 
1141 GACTCAACAG 
12 01 TAAAAGAAAC 

12 61 AATCCACCGT 

13 21 CCATTGGGCG 
13 81 TCAAAGAAGG 
1441 GTCGGCGGTC 
1501 ATCTTCTTCA 
1561 TGGATGATAT 
1621 CACGTTTGTT 
1681 AAGAGTGGTG 
1741 GAATTCTTGG 

SEQ. ID. NO. 170 

1 MKEHVQNNMS 
61 DQLTHRGLAK 
121 TYDRADMAFA 
181 GELVFSLTRN 
241 IRLAKARASL 
301 DLQNAIRLTK 
3 61 RKVEESDFEK 
421 RDKNSWEDPE 
481 HCFTWELPDG 



(69,1755) 
TABACUM 



ACTATCTCCT 

GAAAGAGATG 
CATGATATTC 
TAAACGTTTG 
CCAGTTAACT 
TAAAATGGGT 
ACAGGAACAC 
ATATGACAGG 
ACTATGTGTA 
CGAAGCGGAT 
TGAACTTGTT 
TGAAGATGGA 
TTTCAATATA 
TAGACTTGCT 
TATTATTAGA 
TATGGTGGAT 
TTTGCAGAAT 
GTTTGGAGGG 
GAGTCCTGAA 
AAAAGTTGAA 
TCTACGACTT 
CTCCGGCTAC 
TGACAAAAAT 
TGTACCAGAT 
TTGCCCCGGT 
TTGTTTTACT 
TTTTGGACTC 
GTGTCCCCTT 
CTTGTGATTT 
GCACA 



AAATCTCCAC 
GTGCAAAACA 
TACTTCATCG 
CCTCCAGGTC 
CACCGTGGCC 
TATGTTCACA 
GACATCATAT 
GCAGACATGG 
ATGAAACTCT 
TCCATGGTTA 
TTCAGTCTCA 
CAAGGCGAGT 
GCTGATTTTA 
AAGGCTAGAG 
AAGAAAGCTT 
GAGCTTTTAG 
GCTATCAGAC 
ACAGAAACAG 
GATCTTAAAA 
GAATCTGACT 
CACCCTCCAA 
CATATTCCGG 
TCATGGGA-2iG 
TTTAAAGGAG 
ATGCAACTTG 
TGGGAATTGC 
ACTGCTCCAA 
TATTAATTGA 
TTCCACCTTT 



TCAAAAACAA 
ATATGAGCAC 
TCCCTCTCTT 
CAACTGGCTG 
TTGCCAAACT 
AAATTGTAGT 
TTTCGAACCG 
CTTTTGCTGA 
TCAGCCGCAA 
GAATTGTAAC 
CTCGTAATAT 
TCATTAAAAT 
TTCCATGGCT 
CGTCGCTTGA 
ATGTTAATGG 
CTTTTTACAG 
TTACTAAGGA 
TGGCTTCTGC 
AGGTACAACA 
TTGAAAAATT 
TCCCTCTCCT 
CAAAGTCACA 
ATCCTGAAAC 
GTAATTTTGA 
GGCTTTATGC 
CAGATGGTAT 
GAGCTAATCG 
AGAAAAAAGG 
TGGTTAAATA 



AGAAGAGAAA 
TTCTCTTCTT 
CTGCTTATTC 
GCCTCTCATT 
AGCCCAAAAA 
CTCTGGTCCA 
TCCAGCGACC 
CTATGGACTC 
ACGAGCTGAG 
AACCAACACA 
TATCTACAGA 
TATGCAAGAG 
AGGGTGGGTT 
TGGGTTCATT 
CAAAAATGAT 
TGAGGAAGCA 
TAATATCAAA 
AATAGAATGG 
AGAGCTGGCT 
AACATACTTA 
CCTCCATGAG 
TGTTATTATA 
TTATAAACCA 
GTTTATACCA 
ATTGGAAATG 
GAAACCAAGT 
ACTCGTGGCT 
TGGGGCTTTT 
TACGAATTAT 



GATTTAAAAC 
GAAACTTTAC 
CTTCTCTCCA 
GGTAACATGA 
TATGGTGGTG 
GACGAAGCTC 
GTAGCCATAA 
TTCTGGCGGC 
TCATGGGACT 
GGCACAGCTG 
GCTGCTTTTG 
TTTTCGAAGC 
GGTAAGCAGA 
GATTCGATTA 
GGAGGTGATC 
AAAGTAACTG 
GCTATCATCA 
GCCATGGCAG 
AACGTTGTTG 
AGATGTTGTC 
ACCGCCGAGG 
AATTCATTTG 
TCTAGGTTTC 
TTTGGGTCGG 
GCTGTGGCCC 
GAGCTTAAAA 
GTGCCTACTC 
ACTTGCATCA 
TATGATATAC 



TSLLETLQAT 
LAQKYGGVFH 
DYGLFWRQMK 
IIVRAAFGTC 
DGFIDSIIDD 
DNIKAIIMDV 
LTYLRCCLKE 
TYKPSRFLKE 
MKPSELKMDD 



PMIFYFIVPL 
LKMGYVHKIV 
KLCVMKLFSR 
SEDGQGEFIK 
HIIRKKAYVN 
MFGGTETVAS 
TLRLHPPIPL 
GVPDFKGGNF 
IFGLTAPRMT 



FCLFLLSKSR 
VSG PDEARQV 
KRAESWDSVR 
IMQEFSKLFG 
GKNDGGDRET 
AIEWAMAELM 
LLHETAEEST 
EFIPFGSGRR 
RLVAVPTPRL 



RKRLPPGPTG 
LQEHDIIFSN 
DEADSMVRIV 
AFNIADFIPW 
DMVDELLAFY 
RSPEDLKKVQ 
VSGYHIPAKS 
SC PGMQLGLY 
LCPLY 



WPLIGNMMMM 
RPATVAISYL 
TTNTGTAVNL 
LGWVGKQSLN 
SEEAKVTESE 
QELANWGLN 
HVIINSFAIG 
ALEMAVAHLL 



26/141 



wo 2005/038033 



PCT/US2004/034065 



NAME D105-AD6 
ORGAiqiSM NICOTIANA 
SEQ. ID. MO. 171 

1 TGTGCTTGTG 
61 TTGCAATTTC 
121 ATGTGTGGTT 
181 ACTCTTACAA 
241 TGTCTAAGCC 
301 ACAAAACCAT 
361 TCCTGATCAC 
421 AGCCGCTTGG 
4 81 ATAAATGGGC 
541 ATATGCTACC 
601 TTTCACCAAA 
661 CC ATTTCAAG 
721 AAAAAGAACA 
781 GGTTTTTGCC 
841 TGGTATTTGG 
901 ATTTATTGGG 
961 AGAAATTTGG 
1021 GGCAAGAO^C 
1081 ATTGGCAAGA 
1141 ATGACAAGTT 
12 01 ATCCAGCAGG 
12 61 GTTTACCAGC 
1321 TATGGGGAGA 
1381 CAACAAAAGG 
1441 AAAATTTTGC 
1501 TTGAACTCTC 
1561 GTGCTCAATT 
1621 TTTATCATTG 
1681 TGTC 

SEQ, ID. NO. 172 

1 MEIPYYSLKI 

61 EMKKMGEEAM 

121 VLTKNFVYQK 

181 EI^SKLEKW 

241 SRTIYIPGWR 

3 01 KEIQQHGNNK 

3 61 QVFGSREVDY 

421 TMLLQHDTEI 

481 MAMILKTYAF 



TABACUM 



AGTGTGGGAG 
TTCATTTGCA 
AAAACCAAAA 
ATTCTTGTTT 
AATCAATTTC 
CACCAATTAT 
AGACCCGGAA 
CAATCCACTC 
TACACATAGA 
TGCATTCCAA 
CGGAACAGAG 
AACTGCGTTT 
ACTTTCACTA 
AACGAAAAGG 
AATTATTAAG 
AATATTATTG 
TATGAGTATT 
TACTTCATCT 
TAAAGCTAGA 
GAATCAGCTA 
ATATGTGATT 
CGGCGTACAG 
TGATGCAATG 
AAAACTTGTG 
TATGTTAGAG 
TCCATCTTAT 
AATTTTGTAC 
TAATCAACCA 



AAGGCCTTCA 
ATTATCTTTG 
GAATTGGAGA 
GGGGATATGA 
TCTCATGACA 
GGTAAGAATT 
CTTGTAAAGG 
ACAAAGTTGG 
AGGCTTCTCA 
TTTACTGCTA 
ATAGATGTGT 
GGAAGTAGTT 
ATTCTAGAAG 
AACAAAAGGA 
AAAAGGATGA 
GCATCCAATT 
GATGAGGTGA 
TTACTTGTAT 
GAAGAGGTTT 
AAAATAGTAA 
AATCGAATGG 
CTCGTGTTAC 
GAGTTCAATC 
TTTTTTCCAT 
GCTAAAATGG 
GCTCATGCTC 
AAGTTGTAGA 
TATTGAGGGA 



ATATGGAGAT 
TACTAAGATG 
AATACCTCAG 
AAGAGATGAA 
TGATTTGGCC 
GTATTGTGTG 
AGGTGCTAAC 
CAGCAACTGG 
ATCCTGCTTT 
GTGAGATGTT 
GGCCATATTT 
ATGAAGAAGG 
TTTCACGCAC 
TGAAGCAAAT 
GTATGATTGA 
TAAAAGAAAT 
TTGAAGAGTG 
GGACTATGAT 
TGCAAGTGTT 
CTATGATCTT 
TAAACAAAGA 
CAACAATGTT 
CAGAGAGATT 
TTAGTTGGGG 
CAATGGCTAT 
CTCATCCACT 
TATGGTCAAT 
ACATGGTTTG 



ACCATATTAC 
GGCATGGAAA 
ACAGCAGGGT 
GAAAATGGGT 
TAGAGTTATG 
GTTTGGGCCA 
GAAGAATTTC 
AATTGCAGGC 
TCACCTTGAC 
GAGCAAATTG 
ACAAACTTTG 
AAGAAAGATT 
AATATATATT 
ATTTAATGAA 
AAATGGAGAA 
CCAACAACAT 
TAAACTCTTC 
TTTGTTGTGC 
TGGGAGTAGG 
AAACGAGGTC 
AACAAAGTTA 
GTTGCAACAT 
TAGTGATGGA 
TCCAAGAATA 
GATTCTGAAA 
ACTACTTCAA 
TTGGAACTTG 
AGGTTAAATC 



AGCTTAAAAA 
ATCTTGAATT 
TTCAAAGGAA 
GAAGAAGCTA 
CCATTCATCC 
AGACCAGCAG 
GTCTATCAGA 
TATGAAACAG 
AAGTTGAAGC 
GAGAAAGTTG 
ACAAGTGATG 
TTTGACCTTC 
CCAGGATGGA 
GTACGAGCAC 
GCACCTGATG 
GG AAACAACA 
TATTTTGCTG 
AAATATCCTA 
GAAGTTGACT 
TTAAGGTTGT 
GGGAATTTGT 
GATACTGAAA 
ATATCCAAAG 
TGTATTGGGC 
ACCTATGCAT 
CCTCAATATG 
TTATGGAACT 
CTCGTGTGTG 



AISSFAIIFV 
SKPINFSHDM 
PLGNPLTKLA 
SPNGTEIDVW 
FLPTI<Rl<rKRM 
KFGMSIDEVI 
DKLNQLKIVT 
WGDDAMEFNP 
ELSPSYAHAP 



LRWAWKILNY 
IWPRVMPFIH 
ATGIAGYETD 
PYLQTLTSDA 
KQIFNEVRAL 
EECKLFYFAG 
MILITEVLRLY 
ERFSDGISKA 
HPLLLQPQYG 



VWIiKPKELEK 
KTITNYGKNC 
KWATHRRLLN 
ISRTAFGSSY 
VFGIIKKRMS 
QETTSSLLVW 
PAGYVIITRMV 
TKGKLVFFPF 
AQLILYKIi 



YLRQQGFKGN 
IVWFGPRPAV 
PAFHLDKLKH 
EEGRKIFDLQ 
MIENGEAPDD 
TMILLCKYPN 
NKETKLGNLC 
SWGPRICIGQ 



SYKFLFGDMK 
LITDPELVKE 
MLPAFQFTAS 
KEQLSLILEV 
LLGILLuZ^StJL 
WQDKAREEVIi 
LPAGVQLVLP 
NFAMLEAKMA 
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NAME D109-AH8 (14,1697) 

ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 173 

1 CCAGCACCAA GACATGGAGA ATTCCTGGGT AGTTTTAGCC TTAACAGGCC TTCTTACATT 
61 AGTTTTTCTC TCAAAGTTTC TTCATAGTCC TCGTCGTAAA CAAAATCTTC CACCAGGTCC 
121 AAAACCATGG CCTATTGTTG GCAATATACA TCTTCTTGGT TCCACCCCTC ACAGATCCCT 
181 TCACGA.2^CTT GCAAAAAGAT ACGGAGATTT AATGCTACTA AAGTTCGGTT CGCGCAATGT 

2 41 CCTTATTTTA TCCTCCCCAG ATATGGCTAG AGAATTCTTG AAAACAAATG ATGCCATTTG 

3 01 GGCTTCTCGC CCTGAGCTTG CCGCTGGTAA ATATACTGCT TATAATTATT GCGACATGAC 
3 61 JVTGGGCACGT TATGGACCCT TTTGGAGACA AGCAAGGAGG ATCTATCTCA ACGAGATTTT 
421 CAATCCTAAA CGTTTGGATT CATTTGAGTA CATTCGCATA GAGGAAAGGC ATAATTTGAT 
481 TTCACGTCTT TTTGTTCTCT CTGGGAAGCC AATTCTTCTT AGAGACCATT TAACTCGGTA 
541 CACTCTTACA AGTATAAGTA GAACAGTATT GAGTGGAAAA TATTTTAGCG AGTCACCTGG 
601 CCAAAATTCA ATGATAACTT TGAAACAATT GCAGGATATG CTTGATAAGT GGTTTTTGCT 
661 TAATGGTGTG ATCAATATTG GGGACTGGAT ACCTTGGCTT GCTTTCTTGG ATTTGCAGGG 
721 TTATGTCAAG CAAATGAAGG AGTTGCATAG GAACTTCGAC AAATTTCATA ACTTTGTGCT 
7 81 AGATGATCAC AAGGCTAATA GGGGAGAGAA GAACTTTGTG CCAAGAGACA TGGTCGATGT 
841 TTTGCTGCAG CAAGCTGAGG ATCCTAATCT TGAGGTCAAA CTCACCAATG ATTGTGTCAA 
901 GGGTCTAATG CAGGACTTAT TGGCTGGCGG CACGGACACC TCAGCAACAA CCGTTGAATG 
961 GGCTTTTTAT GAACTTCTTA GACAACCTAA GATTATGAAG AAAGCACAAC AAGAGCTAGA 

1021 CCTTGTCATT TCACAGGACA GATGGGTTCA AGAAAAAGAT TACACTCAAC TCCCTTACAT 
1081 TGAGTCAATC ATCAAGGAAA CATTGAGGCT TCACCCAGTA AGCACCATGC TTCCACCGCG 
1141 CATTGCCTTG GAGGATTGTC ATGTAGCAGG CTATGACATA CCTAAAGGTA CAATTTTAAT 
12 01 TGTGAACACT TGGAGTATTG GAAGAAATTC ACAGCATTGG GAGTCACCAG AAGAATTCCT 
1261 TCCGGAGAGG TTTGAAGGGA AGAATATTGG TGTCACAGGA CAACATTTTG CGCTCTTGCC 
1321 ATTTGGCGCG GGCCGGAGAA AGTGCCCAGG ATACAGTCTT GGGATTCGTA TAATTAGGGC 
1381 AACTTTAGCT AACTTGTTGC ATGGATTCAA CTGGAGATTG CCTAATGGTA TGAGTCCAGA 
1441 AGACATTAGC ATGGAAGAGA TTTATGGGCT AATTACACAC CCCAAAGTCG CACTTGACGT 
15 01 GATGATGGAG CCTCGACTTC CCAACCATCT TTACAAATAG TGGATAATTA AAACCATTAA 
1561 AATCGTTTTG TTATATGCAT GTCTCATATT TGTAGTGGTC AAAATGTTTG TTTTCTATCA 
1621 TGGATGTTCA GTGCGAGGTT GGGAATTTCA AGTCATTAAC GTGTGAAAAT ATTTTAAATT 
1681 TAAAAAAAAA AAAAAAA 

SEQ. ID. NO. 174 

1 MENSW^ATLAIi TGLLTLVFLS KFLHSPRRKQ NLPPGPKPWP IVGNIHLLGS TPHRSLHELA 
61 KRYGDLMLLK FGSRNVLILS SPDt-lAREFLK TNDAIVJASRP ELAAGKYTAY NYCDMTWARY 
121 GPFWRQARRI YLNEIFNPKR LDSFEYIRIE ERHNLISRLF VLSGKPILLR DHLTRYTLTS 
181 ISRTVLSGKY FSESPGQNSM ITLKQLQDML DKWFLLNGVI NIGDWIPWLA FLDLQGYVKQ 
241 MKELHRNFDK FHNFVLPDHK ANRGEKNFVP RDMVDVLLQQ AEDPNLEVKI> TNDCVKGLMQ 
301 DLLAGGTDTS ATTVEWAFYE LLRQPKIMKK AQQELDLVIS QDRWVQEKDY TQLPYIESII 
361 KETLRLHPVS TMLPPRIALE DCHVAGYDIP KGTILIVNTW SIGRNSQHWE SPEEFLPERF 
421 EGKNIGVTGQ HFALLPFGAG RRKCPGYSLG IRIIRATLAN LLHGFNWRLP NGMSPEDISM 
481 EEIYGLITHP KVALDVMMEP RLPNHLYK 
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NAME D110-AF12 (166,1631) 

ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 175 

1 ACTGTTCAAA TCACAGTAAC AGCATCTTGT GCTGCCATAA TAATTACTCT 
61 ATATGGAGAG TGCTGAATTG GGTTTGGTTC AGACCAAAGA AGCTGGAAAA 
121 AAACAAGGTC TCAAAGGCAA TTCCTACAGG ATTTTGTATG GGGATATGAA 
181 GGTATGATTA AGGAAGCTAA CTCCAAACCC ATGAATCTTT CTGATGATAT 
241 TTGGTCCCTT TCTTTCTTGA TACCATCAAG AAATATGGGA AAAAATCCTT 
301 GGTCCAAAAC CGCTGGTTTT TGTCATGGAC CCCGAGCTTA TAAAGGAAGT 
361 AACTATCTGT ATCAAAAGCC TCATTCAAAT CCATTAACCA AGTTACTGGC 
421 GTAAGCCAAG AGGAAGACAA ATGGGCCAAA CATAGAAAAA TCGTCACTCC 
481 CTGGAGAAGC TAAAGCATAT GCTTCCAGCT TTTTGTTTGA GCTGTACTGA 
541 AAATGGGAAG ACATTGTTGC AGTTGAGGGC TCACATGAGA TAGATATATG 
601 CAACAATTAA CTAGTGATGT GATCTCTCGG ACAGCCTTTG GCAGTAGCTA 
661 AGAAGGATAT TTGAACTTCA AAAGGAACAA GCTCAATTTC TTATGGAAGC 
721 GTTTATATTC CAGGCTGGAG GTTTTTGCCA ACAAAGAGGA ACAGAAGAAT 
781 GAAAAGGATG TTCAAGCCTT AGTTAGAGGT ATTATTGATA AAAGAGTAAA 
841 GCAGGAGAGG TGAATAATGA GGATCTGCTT GGTATATTGC TGGAATCTAA 
901 ATTGAACAGC ATGGAAACAA GGATTTTGGA ATGAGCATTG AAGAAGTCAT 
961 AAGTTATTCT ATTTTGCTGG CCAAGAAACT ACATCAGTGT TGCTTGTATG 
1021 TTGCTGAGCA GGCATCAGGA TTGGCAAGCA CTGGCCAGAG AAGAGGTGTT 
1081 GGGAATCAGA AACCAGATTT TGATGGATTA AATCGTCTAA AAATTGTTAC 
1141 TACGAGTCTT TAAGGCTCTA TCCCCCAGTA GTGACACTTA CCCGAAGGCC 
1201 ACTGTATTAG GAGATGTATC TCTACCAGCA GGTGTGTTAA TCTCCTTACC 
1261 TTGCATCACG ACGAAGAGAT ATGGGGTAAA GATGCAAAGA AGTTCAAGCC 
1321 AGAGATGGAG TCTCAAGTGC AACAAAGGGT CAAGTCACTT TTTTCCCATT 
1381 CCCAGAATAT GCATTGGACA AAATTTTGCC ATGTTAGAAG CAAAGACTAC 
1441 ATCCTACAAC GCTTCTCCTT TGAACTGTCT CCATCTTATG CACATGCTCC 
1501 ATAACTTTGC AACCCCAGTA TGGTGCTCCA CTTATTTTGC ATAAAATATA 
1561 TGTAAGTAGT GTCTCGTTTT ATGTTAAGCA TGAGTCCAAA ATGTTAAGGC 
1621 GC AAAATGGG A 



AGTGGTGTGT 
GCTACTGAGG 
GGAGC TTTCT 
TGCCCCAAGA 
TGTATGGTTG 
ATTCTCCAAA 
ACAAGGACTT 
TGCCTTCCAC 
GATGCTGAGC 
GCCTGGCCTT 
TGAAGCAGGT 
TATACGCTCC 
GAAGGAAATT 
GTCAATGAAA 
TTTTAAAGAA 
TCAAGAATGC 
GACTCTAATA 
GCAAGTCTTT 
AATGATCTTG 
TAAGGAAGAC 
AGTGATCTTA 
AGAGAGATTC 
TACTTGGGGT 
TTTGGCTATG 
TCAGTCCATA 
GTTTATTACT 
TTGTAGAACT 



SEQ. ID. NO, 176 

1 MKELSGMIKE ANSKPMNLSD DIAPRLVPFF 
61 EVFSKNYLYQ KPHSNPLTKL LAQGLVSQEE 
121 TEMLSKWEDI VAVEGSHEID IWPGLQQLTS 
181 EAIRSVYIPG WRFLPTKRNR RMKEIEKDVQ 
2 41 SNFKEIEQHG ^HCDFGMSIEE VIQECKLFYF 
301 VLQVFGNQKP DFDGLNRLKI VTMILYESLR 
361 LPVILLHHDE EIWGKDAKKF KPERFRDGVS 
421 TTLAMILQRP SFELSPSYAH APQSIITLQP 



LDTIKKYGKK SFVWLGPKPL VFVMDPELIK 
DKWAKHRKIV TPAFHLEKLK HMLPAFCLSC 
DVISRTAFGS SYEAGRRIFE LQKEQAQFLM 
ALVRGIIDKR VKSI-IKAGEVN NEDLLGILLE 
AGQETTSVLL VWTLILLSRH QDWQALAREE 
LYPPWTLTR RPKEDTVLGD VSLPAGVLIS 
SATKGQVTFF PFTWGPRICI GQNFAMLEAK 
QYGAPLILHK I 
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NAME D112~AA5 

ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 177 

1 ATTTATCTCT GAAAATGCAA TTCTTCAGCT TGGTTTCCAT TTTCCTATTC CTATCTTTCC 

61 TATTTTTGTT GAGGAAATGG AAGAACTCCA ATAGCCAAAG CAAAAAATTG CCACCAGGTC 

121 CATGGAAAAT ACCAATACTA GGAAGTATGC TTCATATGAT TGGTGGAGAA CCGCACCATG 

181 TCCTTAGAGA TTTAGCCAAA AAAGATGGAC CACTTATGCA CCTTCAGTTA GGTGAAATTT 

2 41 CTGCAGTTGT GGTTACTTCT AGGGACATGG CAAAAGAAGT GCTAAAAACT CATGACGTCG 
301 TTTTTGCATC TAGGCCTAAA ATTGTAGCCA TGGACATTAT CTGTTATAAC CAGTCCGACA 

3 61 TTGCCTTTAG CCCTTATGGC GACCACTGGA GACAAATGCG TAAAATTTGT GTCATGGAAC 
421 TTCTCAATGC AAAGAATGTT CGGTCTTTCA GCTCCATCAG ACGTGATGAA GTCGTTCGTC 
481 TCATTGACTC TATCCGGTCA GATTCTTCTT CAGGTGAGCT AGTTAATTTT ACGCAGAGGA 
541 TCATTTGGTT TGCAAGCTCC ATGACGTGTA GATCAGCATT TGGGCAAGTA CTCAAGGGGC 
601 AAGACATATT TGCCAAAAAG ATCAGAGAAG TAATAGGATT AGCAGAAGGC TTTGATGTGG 
661 TAGACATCTT CCCTACATAC AAGTTTCTTC ATGTTCTCAG TGGGATGAAG CGTAAACTTT 
721 TGAATGCCCA CCTTAAGGTA GACGCCATTG TTGAGGATGT CATCAACGAG CACAAGAAAA 
7 81 ATCTTGCAGC TGGCAAGAGT AATGGCGCAT TAGGAGGCGA AGATCTAATT GATGTCCTAC 
841 TGAGACTTAT GAATGACACA AGTCTTCAAT TTCCCATCAC CAACGACAAT ATCAAAGCTG 
901 TTGTTGTTGA CATGTTTGCT GCCGGAACAG AAACTTCATC AACAACAACT GTATGGGCCA 
961 TGGCTGAAAT GATGAAGAAT CCAAGTGTAT TCGCCAAAGC TCAAGCAGAA GTGCGAGAAG 

1021 CCTTTAGGGA CAAAGTATCT TTTGATGAAA ATGATGTGGA GGAGCTGAAA TACTTAAAGT 

1081 TAGTCATTAA AGAAACTTTG AGACTTCATC CACCGTCTCC ACTTTTGGTC CCAAGAGAAT 

1141 GCAGGGAAGA TACGGATATA AACGGCTACA CTATTCCTGC AAAGACCAAA GTTATGGTTA 

1201 ATGTTTGGGC ATTGGGAAGA GATCCAAAAT ATTGGGATGA CGCGGAAAGC TTTAAGCCAG 

1261 AGAGATTTGA GCAATGTTCT GTAGATATTT TTGGTAATAA TTTTGAGTTT CTTCCCTTTG 

1321 GCGGGGGACG GAGAATTTGT CCTGGAATGT CATTTGGTTT AGCTAATCTT TACTTACCAT 

13 81 TGGCTCAATT ACTCTATCAC TTTGACTGGA AACTCCCAAC CGGAATCAAG CCAAGAGACT 

1441 TGGACTTGAC CGAATTATCG GGAATAACTA TTGCTAGAAA GGGTGACCTT TACTTAAATG 

1501 CTACTCCTTA TCAACCTTCT CGAGAGTAAT TTACTATTGC CATAAACATT TTAAATTTCC 

1561 TTCATCAACC TC 



SEQ. ID. NO. 178 

1 MQFFSLVSIF LFLSFLFLLR KWKNSNSQSK KLPPGPWKIP ILGSMLHMIG GEPHHVLRDL 
61 AKKDGPLMHL QLGEISAVW TSRDMAKEVL KTHDWFASR PKIVAMDIIC YNQSDIAFSP 
121 YGDHWRQMRK ICVMELLNAK ^JVRSFSSIRR DEWRLIDSI RSDSSSGELV NFTQRIIWFA 
181 SSMTCRSAFG QVLKGQDIFA KKIREVIGLA EGFDWDIFP TVKFLHVLSG MKRKLLNAHL 
241 KVDAIVEDVI NEHKKNLAAG KSNGALGGED LIDVLLRLMN DTSLQFPITN DNIKAVWDM 
3 01 FAAGTETSST TTVWAMAEMM KNPSVFAKAQ AEVREAFRDK VSFDENDVEE LKYLKLVIKE 
3 61 TLRLHPPSPL LVPRECREDT DINGYTIPAK TKVMVNVWAL GRDPKYWDDA ESFKPERFEQ 
421 CSVDIFGNNF EFLPFGGGRR ICPGMSFGLA NLYLPLAQLL YHFDWKLPTG IKPRDLDLTE 
481 LSGITIARKG DLYLNATPYQ PSRE 
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NAME D120-AH4 

ORGANISM NICOTIANA TABACUM 
SEQ, ID. NO. 179 

1 ATAATGCTTT CTCCCATAGA AGCCATTGTA GGACTAGTAA CCTTCACATT TCTCTTCTTC 
61 TTCCTATGGA CAAAAAAATC TCAAAAACCT TCAAAACCCT TACCACCGAA AATCCCCGGA 
121 GGATGGCCGG TAATCGGCCA TCTTTTCCAC TTCAATGACG ACGGCGACGA CCGTCCATTA 
181 GCTCGAAAAC TCGGAGACTT AGCTGACAAA TACGGCCCCG TTTTCACTTT TCGGCTAGGC 
241 CTTCCCCTTG TCTTAGTTGT AAGCAGTTAC GAAGCTGTAA AAGACTGTTT CTCTACAAAT 
301 GACGCCATTT TTTCCAATCG TCCAGCTTTT CTTTACGGCG ATTACCTTGG CTACAATAAT 
3 61 GCCATGCTAT TTTTGGCCAA TTACGGACCT TACTGGCGAA AAAATCGAAA ATTAGTTATT 
421 CAGGAAGTTC TCTCCGCTAG TCGTCTCGAA AAATTCAAAC ACGTGAGATT TGCAAGAATT 
481 CAAGCGAGCA TTAAGAATTT ATATACTCGA ATTGATGGAA ATTCGAGTAC GATAAATTTA 
541 ACTGATTGGT TAGAAGAATT GAATTTTGGT CTGATCGTGA AGATGATCGC TGGAAAAAAT 
601 TATGAATCCG GTAAAGGAGA TGAACAAGTG GAGAGATTTA AGAAAGCGTT TAAGGATTTT 
661 ATGATTTTAT CAATGGAGTO^ TGTGTTATGG GATGCATTTC CAATTCCATT ATTTAAATGG 
7 21 GTGGATTTTC AAGGGCATGT TAAGGCTATG AAAAGGACTT TTAAAGATAT AGATTCTGTT 
7 81 TTTCAGAATT GGTTAGGGGA ACATATTAAT AAAAGAGAAA AAATGGAGGT TAATGCAGAA 
841 GGGAATGAAC AAGATTTCAT TGATGTGGTG CTTTCAAAAA TGAGTA.d^TGA ATATCTTGGT 
901 GAAGGTTACT CTCGTGATAC TGTCATTAAA GCAACGGTGT TTAGTTTGGT cttggatgca 
961 GCAGACACAG TTGCTCTTCA CATAAATTGG GGAATGGCAT TATTGATAAA CAATCAAAAG 
1021 GCCTTGACGA AAGCACAAGA AGAGATAGAC ACAAAAGTTG GTAAGGACAG ATGGGTAGAA 
1081 GAGAGTGATA TTAAGGATTT GGTATACCTC CAAGCTATTG TTAAAGAAGT GTTACGATTA 
1141 TATCCACCAG GACCTTTGTT AGTACCACAC GAAAATGTAG AAGATTGTGT TGTTAGTGGA 
1201 TATCACATTC CTAAAGGGAC AAGATTATTC GC AAACGTCA TGAAACTGCT ACGTGATCCT 
1261 AAACTCTGGC CTGATCGTGA TACTTTCGAT CCAGAGAGAT TCATTGCTAC TGATATTGAC 
13 21 TTTCGTGGTC AGTACTATAA GTATATCCCG TTTGGTTCTG GAAGACGATC TTGTCCAGGG 
13 81 ATGACTTATG CATTGCAAGT GGAACACTTA ACAATGGCAC ATTTGATCCA AGGTTTCAAT 
1441 TACAGAACTC CAAATGACGA GCCCTTGGAT ATGAAGGAAG GTGCAGGCAT AACTATACGT 
1501 AAGGTAAATC CTGTGGAACT GATAATAGCG CCTCGCCTGG CACCTGAGCT TTATTAAAAC 
1561 CTAAGATCTT TCATCTTGGT TGATCATTGT ATAATACTCC TAAATGGATA TTCATTTACC 
1621 TTTTATCAAT TAA 

SEQ. ID. NO. 180 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK FKHVRFARIQ ASIKNI.YTRI DGNSSTINLT 
181 DWLEEI.NFGL IVKMIAGKITY ESGKGDEQVE RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 

2 41 DFQGHVKAMK RTFKDIDSVF QNWLGEHINK REKMEVNAEG NEQDFID\A;L SKMSrrEYLGE 

3 01 GYSRDTVIKA TVFSLVLDAA DTVALHINWG MALLINNQKA LTKAQEEIDT KVGKDRWVEE 
361 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE NVEDCWSGY HIPKGTRLFA IJVMKLLRDPK 
421 LWPDPDTFDP ERFIATDIDF RGQVYKYIPF GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP RLAPELY 
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NAME D121-AA8 
ORGAinSM HICOTIANA TABACUM 
SEQ. ID. NO. 181 

1 AATCCATAAT GCTTTCTCCC ATAGAAGCCA TTGTAGGACT AGTAACCTTC ACATTTCTCT 
61 TCTTCTTCCT ATGGACAAAA AAATCTCAAA AACCTTCAAA ACCCTTACCA CCGAAAATCC 
121 CCGGAGGATG GCCGGTAATC GGCCATCTTT TCCACTTCAA TGACGACGGC GACGACCGTC 
181 CATTAGCTCG AAAACTCGGA GACTTAGCTG ACAAATACGG CCCCGTTTTC ACTTTTCGGC 

2 41 TAGGCCTTCC CCTTGTCTTA GTTGTAAGCA GTTACGAAGC TGTAAAAGAC TGTTTCTCTA 

3 01 CAAATGACGC CATTTTTTCC AATCGTCCAG CTTTTCTTTA CGGCGATTAC CTTGGCTACA 
361 ATAATGCCAT GCTATTTTTG GCCAATTACG GACCTTACTG GCGAAAAAAT CGAAAATTAG 
421 TTATTCAGGA AGTTCTCTCC GCTAGTCGTC TCGAA;^AA'rT CAAACACGTG AGATTTGCAA 
481 GAATTCAAGC GAGCATTAAG AATTTATATA CTCGAATTGA TGGAAATTCG AGTACGATAA 
541 ATTTAACTGA TTGGTTAGAA GAATTGAATT TTGGTCTGAT CGTGAAGATG ATCGCTGGAA 
601 AAAATTATGA ATCCGGTAAA GGAGATGAAC AAGTGGAGAG ATTTAAGAAA GCGTTTAAGG 
661 ATTTTATGAT TTTATCAATG GAGTTTGTGT TATGGGATGC ATTTCCAATT CCATTATTTA 
721 AATGGGTGGA TTTTCAAGGG CATGTTAAGG CTATGAAAAG GACTTTTAAA GATATAGATT 
781 CTGTTTTTCA GAATTGGTTA GAGGAACATA TTAATAAAAG AGAAAAAATG GAGGTTAATG 
841 CAGAAGGGAA TGAACAAGAT TTCATTGATG TGGTGCTTTC AAAAATGAGT AATGAATATC 
901 TTGGTGAAGG TTACTCTCGT GATACTGTCA TTAAAGCAAC GGTGTTTAGT TTGGTCTTGG 
961 ATGCAGCAGA CACAGTTGCT CTTCACATAA ATTGGGGAAT GGCATTATTG ATAAACAATC 

1021 AAAAGGCCTT GACGAAAGCA CAAGAAGAGA TAGACACAAA AGTTGGTAAG GACAGATGGG 
1081 TAGAAGAGAG TGATATTAAG GATTTGGTAT ACCTCCAAGC TATTGTTAAA GAAGTGTTAC 
1141 GATTATATCC ACCAGGACCT TTGTTAGTAC CACACGAAAA TGTAGAAGAT TGTGTTGTTA 
1201 GTGGATATCA CATTCCTAAA GGGACAAGAT TATTCGCAAA CGTCATGAAA CTGCAACGTG 

12 61 ATCCTAAACT CTGGTCTGAT CCTGATACTT TCGATCCAGA GAGATTCATT GCTACTGATA 
1321 TTGACTTTCG TGGTCAGTAC TATAAGTATA TCCCGTTTGG TTCTGGAAGA CGATCTTGTC 

13 81 CAGGGATGAC TTATGCATTG CAAGTGGAAC ACTTAACAAT GGCACATTTG ATCCAAGGTT 
1441 TCAATTACAG AACTCCAAAT GACGAGCCCT TGGATATGAA GGAAGGTGCA GGCATAACTA 
1501 TACGTAAGGT AAATCCTGTG GAACTGATAA TAGCGCCTCG CCTGGCACCT GAGCTTTATT 
1561 AAAACCTAAG ATCATCTTGC TTGAT 

SEQ. ID. NO, 182 

1 MLSPIEAIVG LVTFTFLFFP LWTKKSQKPS KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 

61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 

121 MLFLA^IYGPY VmKimKLVIQ EVLSASRLEK FKHVKFARIQ ASIKITLYTRI DGNSSTINLT 

181 DWLEELNFGL IVKI'IIAGKNY ESGKGDEQVE RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 

241 DFQGHVKA>n< RTFKDIDSVF QtJWLEEHINK REKMEVNAEG NEQDFIDWL SKMSNEYLGE 

3 01 GYSRDTVIKA TVFSLVLDAA DTVALHINWG MALLINNQKA LTKAQEEIDT KVGKDRWVEE 

361 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE rjVEDCWSGY HIPKGTRLFA NVMKLQRDPK 

421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 

481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP RLAPELY 
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NAME P122-AF10 
ORGANISM NICOTIANA 
SEQ. ID. NO. 183 

1 CTAAAACTCC 
61 TCTCTTCTAC 
121 AATTCCCGGA 
181 CCGTCCATTA 

2 41 CCGGCTAGGC 

3 01 CTCTACAAAT 
361 CTACAATAAT 
421 ATTAGTCATT 
481 TGGTAAAATT 
541 GATAAATCTA 
601 TGGGAAAAAT 

6 61 TAAGGATTTT 
721 GTTCAAATGG 

7 81 AGATTCTGTT 
841 TAATGCACAA 
901 ATATCTTGAT 
961 CTTGGATGCT 

1021 CAATCAACAT 
1081 ATGGGTAGAA 
1141 GTTACGATTA 
12 01 TGTTAGTGGA 
12 61 GCGCGATCCT 
1321 TGATATTGAC 
1381 TTGTCCGGGG 
1441 GGGTTTCAAT 
1501 AACTATACGT 
1561 TTATTAAAAC 

SEQ. ID. NO. 184 

1 MVSPVEAIVG 
61 RKLGDLADKY 
121 MLFLTKYGPY 
181 DWLEELNFGL 
241 DFQGWKAMK 
3 01 GYSRDTVIKA 
361 SDIKDLVYliQ 
421 LWSNPDKFDP 
481 KTPNDEPLDM 



TABACUM 

ATAATGGTTT 
TTCCTATGGC 
GGGTGGCCGG 
GCTCGAAAAC 
CTTCCGCTTG 
GACGCCATTT 
GCCATGCTAT 
CAGGAAGTTC 
CAAACGAGCA 
ACTGATTGGT 
TATGAATCCG 
ATAATTTTAT 
GTGGATTTTC 
TTTCAGAATT 
GGGAATGAAC 
GAAGGTTACT 
GCGGACACAG 
GCCTTGAAGA 
GAGAGTGATA 
TATCCACCAG 
TATCACATTC 
AAACTCTGGT 
TACCGTGGTC 
ATGACTTATG 
TACAAAACTC 
AAAGTAAATC 
CTTAGATGTT 



CTCCCGTAGA 
CCAAAAAATT 
TAATCGGCCA 
TCGGAGACTT 
TGTTAATTGT 
TCTCCAATCG 
TTTTGACAAA 
TCTCTGCTAG 
TTAAGAGTTT 
TAGAAGAATT 
GTAAAGGAGA 
CAATGGAGTT 
AAGGCTATGT 
GGTTAGAGGA 
AAGATTTCAT 
CTCGTGATAC 
TTGCTCTTCA 
AAGCACAAGA 
TTAAGGATTT 
GACCTTTATT 
CTAAAGGGAC 
CAAATCCTGA 
AGCACTATGA 
CATTACAAGT 
CAAATGACGA 
CTGTAGAAGT 
TTATCTTGAT 



AGCCATTGTA 
TCAAATACCT 
TCTTTTCTAC 
AGCTGACAAA 
AAGCAGTTAC 
TCCAGCTTTT 
ATACGGACCT 
TCGTCTCGAA 
ATACACTCGA 
GAATTTTGGT 
TGAACAAGTG 
TGTGTTATGG 
TAAGGCCATG 
ACATGTCAAG 
TGATGTGGTG 
TGTCATAAAA 
CATGAATTGG 
AGAGATCGAT 
GGTCTACCTC 
AGTACCTCAT 
TAGACTATTC 
TAAGTTTGAT 
GTTTATCCCA 
GGAACACCTA 
GCCCTTGGAT 
GACAATTACG 
TGTACTAATA 



GGACTAGTAA 
TCAAAACCAT 
TTCGATGATG 
TACGGCCCGG 
GAAGCTGTAA 
CTTTACGGTG 
TATTGGCGAA 
AAATTGAAGC 
ATTGATGGAA 
CTGATCGTGA 
GAGAGATTTA 
GATGCTTTTC 
AAAAGGACAT 
AAAAGAGAAA 
CTTTCAAAAA 
GCAACAGTGT 
GGAATGGCAT 
AAGAAAGTTG 
CAAGCTATTG 
GAAAATGTAG 
GCGAACGTTA 
CCAGAGAGAT 
TTTGGTTCTG 
ACAATAGCAC 
ATGAAGGAAG 
GCTCGCCTGG 
TATATATGCA 



CCCTTACACT 
TACCACCGAA 
ACGGCGACGA 
TTTTCACTTT 
AAGACTGCTT 
AATACCTTGG 
AAAATAGAAA 
ACGTGAGATT 
ATTCGAGTAC 
AAATGATCGC 
GGAAAGCGTA 
CAATTCCATT 
TTAAGGATAT 
AAATGGAGGT 
TGAGTAATGA 
TTAGTTTGGT 
TACTGATAAA 
GTAAGGAAAG 
TTAAAGAAGT 
AGGATTGTGT 
TGAAATTGCA 
TCTTCGCTGA 
GAAGACGATC 
ATTTGATCCA 
GTGCAGGATT 
CACCTGAGCT 
GAAAAAATTG 



LVTLTLLFVF 

GPVFTFRLGL 
WRKNRKLVIQ 
IVKMIAGKNY 
RTFKDIDSVF 
TVFSLVLDAA 
AIVKEVLRIiY 
ERFFADDIDY 
KEGAGLTIRK 



LWPKKFQIPS 
PLVIilVSSYE 
EVLSASRLEK 
ESGKGDEQVE 
QrJWLEEHVKK 
DTVALHMNWG 
PPGPLLVPHE 
RGQHYEFIPF 
VNPVEVTITA 



KPLPPKIPGG 

AVKDCFSTND 
LKHVKFGKIQ 
RFRKAYKDFI 
REKKEVNAQG 
MALLINNQHA 
NVEDCWSGY 
GSGRRSCPGM 
RJLAPELY 



WPVIGHLFYF 
AIFSNRPAFL 
TSIKSLYTRI 
ILSMEFVLWD 
ITEQDFIDWL 
LKKAQEEIDK 
HIPKGTRLFA 
TYALQVEHLT 



DDDGDDRPLA 
VGEYLG^TINA 
DGNSSTINLT 
AFPIPLFKWV 
SKl-lSNEYLDE 
KVGKERWVEE 
NVMKLQRDPK 
lAHLIQGFNY 
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NAME D12 8-AB7 

ORGANISM NICOTIANA 
SEQ. ID. NO. 185 

1 CGAGGCTCCC 

61 AGACCTTAAT 

121 CAAAGCGTTT 

181 AAGTTGGTGA 

241 TTTTCTTGTT 

301 AAGAAGTTTT 

361 ATATTTTTAC 

421 AAATGAGGAG 

481 GGGGGTGGGA 

541 CTACTAATGG 

601 GGATTATGTT 

661 CTTTGAATGG 

721 TTCCAATTTT 

7 81 AGAGGCTGCA 

841 AGAGCTCGGA 

901 AGGGAGAGAT 

961 TTGAAACAAC 

1021 TCCAAAAGAA 

1081 AACCAGACAC 

1141 GTATGGCAAT 

1201 TTGATATTCC 

1261 CTCATTGGAA 

1321 TTGAGGCCAA 

13 81 CTGGAATTAT 

1441 TTGAGCTGTT 

1501 TCAGTCTCCA 

1561 TGTGATCTTA 

1621 ATT 

SEQ. ID. NO. 186 

1 MDLLLLEKTL 

61 YAKKFGDLFL 

121 VYGEHWRKMR 

181 r-lMYNNMPRIM 

241 ICKEVKEKFXi 

301 ENINVAAIET 

361 IKEALRLRMA 

421 FFEEEKHVEA 

481 TTEKGGQFSL 



TABACUM 



CACCAAAAAA 
TGGTCTTTTC 
TAAGCTTCCT 
TGATTTAAAC 
AAGAATGGGT 
ACACACACAA 
TGGAAAAGGT 
AATTATGACT 
GTTTGAGGTG 
GATCGTATTA 
TGATAGGAGA 
TGAAAGGAGT 
GAGGCCTTTT 
GCTTTTCAAA 
CAGCAATGCC 
CAATGAGGAC 
ATTATGGTCA 
ACTGCGCGAC 
CCACAAGCTT 
TCCTCTATTA 
AGCAGAGAGC 
GAAACCCGAA 
TGGCAATGAC 
ACTTGCATTG 
GCCTCCTCCA 
CATTTTGAAG 
TTAATTAAGG 



TCATTTCTCT 
TTTGCCATTT 
CCAGGACCAA 

CACAGAAATC 
CAACGTAACT 
GGTGTTGAAT 
CAAGATATGG 
GTACCATTTT 
GCAAGTGTAA 
AGGAGGAGAT 
TTTGAGAGTG 
AGATTGGCTC 
TTGAGAGGTT 
GATTACTTTG 
CTAAAATGTG 
AACGTTCTTT 
ATTGAGTGGG 
GAGATTGACA 
CCATACCTTC 
GTCCCACACA 
AAAATCTTGG 
GAGTTCAGAC 
TTCAGATATC 
CCAATTCTTG 
GGCCAGTCGA 
CATTCCACCA 
GGTTCTGAAG 



CGTCTAAAAT 
TAATCGCTTT 
TTCCAGTACC 

TTACTGATTA 
TAGTTGTTGT 
TTGGTTCAAG 
TTTTTACTGT 
TTACTAATAA 
TTGAGGATGT 
TACAATTAAT 
AAGATGATCC 
AAAGTTTTGA 
ATTTGAAGAT 
TTGATGAAAG 
CGATTGATCA 
ACATTGTTGA 
GTATCGCCGA 
CAGTTCTTGG 
AGGCTGTGAT 
TGAACCTTCA 
TTAACGCTTG 
CCGAGAGGTT 
TTCCGTTTGG 
GCATCACTTT 
AGCTCGACAC 
TTGTGTTGAA 
AAATTTGATA 



GGATCTTCTC 
AATTGTCTCT 
AGTTTTTGGT 

TGCCAAAAAA 
GTCATCTCCT 
AACAAGAAAT 
ATATGGTGAA 
AGTTGTGCAA 
GAAAAAAAAT 
GATGTATAAT 
TTTGTTTGTT 
GTATAATTAT 
CTGTAAAGAA 
AAAGAAGCTT 
CATTCTTGAG 
A.\ACATCAAT 
GCTAGTCAAC 
ACCAGGAGTG 
CAAGGAGGCA 
CGACGCAAAG 
GTGGTTAGCT 
CTTTGAAGAG 
CGTTGGTAGG 
GGGACGTTTG 
CACAGAGAAA 
ACCAAGGTCT 
GTGTTGGATA 



TTACTAGAGA 
AAACTTCGTT 
AATTGGCTTC 
TTTGGCGATC 
GAATTAGCTA 
GTTGTGTTTG 
CATTGGAGAA 
CAGTATAGAG 
CCTGAATCTG 
AATATGTTTA 
AAGCTTAAGG 
GGTGATTTTA 
GTTAAGGAGA 
TCAAATACCA 
GCTCAACAGA 
GTTGCTGCAA 
CACCCTCACA 
CAAGTGACTG 
CTTCGTCTCC 
CTTGGCGGGT 
AACAACCCGG 
GAGAAGCATG 
AGGAGCTGCC 
GTTCAGAACT 
GGTGGACAGT 
TTCTGAACTT 
TTAAGGGCGA 



IGLFFAILIA 
LRMGQRNLW 
RIMTVPFFTN 
FDRRFESEDD 
QLFKDYFVDE 
TLWSIEWGIA 
IPLLVPHMNL 
NGNDFRYLPF 
HILKHSTIVL 



LIVSKLRSKR 
VSSPELAKEV 
KWQQYRGGW 
PLFVKLKALN 
RKKLSNTKSS 
ELVNHPHIQK 
HBAKLGGFDI 
GVGRRSCPGI 
KPRSF 



FKLPPGPIPV 
LHTQGVEFGS 
EFEVASVIED 
GERSRLAQSF 
DSNALKCAID 
KLRDEIDTVIi 
PAESKILVNA 
ILALPILGIT 



PVFGNWLQVG 
RTRNWFDIF 
VKKNPESATN 
EYNYGDFIPI 
HILEAQQKGE 
GPGVQVTEPD 
WWLAi^n^PAHW 
LGRLVQNFEL 



DDLNHRNLTD 
TGKGQDMVFT 
GIVLRRRLQL 
LRPFLRGYLK 
INEDNVLYIV 
THKLPYLQAV 
KKPEEFRPER 
LPPPGQSKLD 
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NAME D129-AD10 
ORGAIJISM NICOTIANA 
SEQ. ID, NO. 187 

1 CAACACGCTT 
61 TAATAATTAT 
121 AAGCTACGCC 
181 AATCTCGCCG 
241 TGATGATGGA 
301 TTTTTCACCT 
3 61 GCCAAGTATT 
421 GTTACCTAAC 
481 AGATGAGAAA 
541 CAGTTCGAGA 
601 TTAACTTAGG 
661 GAACTTGTTC 
721 TATTTGGCGC 
781 GTCTAAATAT 
841 TTGATGACCA 
901 GAGAAACTGA 
961 AGTCCGAAGA 
1021 TGGATGTAAT 
1081 AGCTTATGAG 
1141 GACTCAACAG 
1201 TAAAAGAAAC 
1261 AATCCACCGT 
1321 CCATTGGGCG 
1381 TCAAAGAAGG 
1441 GTCGGCGGTC 
1501 ATCTTCTTCA 
1561 TGGATGATAT 
1621 CACGCTTGTT 



TABACUM 

actatctcct 
gaaagagatg 
catgatattc 
taaacgtttg 
ccagttaact 
taaaatgggt 
acaggaacac 
Xtatgacagg 
actatgtgta 
cgaagcggat 
tgaacttgtt 
tgaagatgga 
tttcaatata 
tagacttgct 

TATTATTAGA 
TATGGTGGAT 
TTTGCAGAAT 
GTTTGGAGGG 
GAGTCCTGAA 
AAAAGTTGAA 
TCTACGACTT 
CTCCGGCTAC 
TGACAAAAAT 
TGTACCAGAT 
TTGCCCCGGT 
TTGTTTTACT 
TTTTGGACTC 
GTGTCCCCTT 



AAATCTCCAC 

GTGCAAAACA 

TACTTCATCG 

CCTCCAGGTC 

CACCGTGGCC 

TATGTTCACA 

GACATCATAT 

GCAGACATGG 

ATGAAACTCT 

TCCATGGTTA 

TTCAGTCTCA 

CAAGGCGAGT 

GCTGATTTTA 

AAGGCTAGAG 

AAGAAAGCTT 

GAGCTTTTAG 

GCTATCAGAC 

ACAGAAACAG 

GATCTTAAAA 

GAATCTGACT 

CACCCTCCAA 

CATATTCCGG 

TCATGGGAAG 

TTTAAAGGAG 

ATGCAACTTG 

TGGGAATTGC 

ACTGCTCCAA 

TATTAATTGA 



TCAAAAACAA 

ATATGAGCAC 

TCCCTCTCTT 

CAACTGGCTG 

TTGCCAAACT 

AAATTGTAGT 

TTTCGAACCG 

CTTTTGCTGA 

TCAGCCGC AA 

GAATTGTAAC 

CTCGTAATAT 

TCATTGAAAT 

TTCCATGGCT 

CGTCGCTTGA 

ATGTTAATGG 

CTTTTTACAG 

TTACTAAGGA 

TGGCTTCTGC 

AAGTACAACA 

TTGAAAAATT 

TCCCTCTCCT 

CAAAGTCACA 

ATCCTGAAAC 

GTAATTTTGA 

GGCTTTATGC 

CAGATGGTAT 

GAGCTAATCG 

AGAAAAAAGG 



AGAAGAGAAA 

TTCTCTTCTT 

CTGCTTATTC 

GCCTCTCATT 

AGCCCAAAAA 

CTCTGGTCCA 

TCCAGCGACC 

CTATGGACTC 

ACGAGCTGAG 

AACCAACACA 

TATCTACAGA 

TATGCAAGAG 

AGGGTGGGTT 

TGGGTTCATT 

CAAAAATGAT 

TGAGGAAGCA 

TAGTATCAAA 

AATAGAATGG 

AGGGCTGGCT 

AACATACTTA 

CCTCCATGAG 

TGTTATTATA 

TTATAAACCA 

GTTTATACCA 

ATTGGAAATG 

GAAACCAAGT 

ACTCGTGGCT 

TGGGGCT 



GATTTAAAAC 

GAAACTTTAC 

CTTCTCTCCA 

GGTAACATGA 

TATGGTGGTG 

GACGAAGCTC 

GTAGCCATAA 

TTCTGGCGGC 

TCATGGaA.CT 

GGCACAGCTG 

GCTGCTTTTG 

TTTTCGAAGC 

GGTAAGCAGA 

GATTCGATTA 

GGAGGTGATC 

AAAGTAACTG 

GCTATCATCA 

GCCATGGCAG 

AACGTTGTTG 

AGATGTTGTC 

ACCGCCGAGG 

AATTCATTTG 

TCTAGGTTTC 

TTTGGGTCGG 

GCTGTQGCCC 

GAGCTTAAAA 

GTGCCTACTC 



SEQ. ID. NO. 188 

1 MKEMVQm^mS TSLLETLQAT PMIFYFIVPL FCLFLLSKSR RKFOiPPGPTG WPLIGNMMMM 

61 DQLTHRGIAK LAQKYGGVFH LKMGYVHKIV VSGPDEARQV LQEHDIIFSN RPATVAISYL 

121 TYDRADMAFA DYGLFWRQMR KLCVMKLFSR KRAESWDSVR DEADSMVRIV TTNTGTAVNIi 

181 GELVFSLTRN IIYRAAFGTC SEDGQGEFIE IMQEFSKLFG AFNIADFIPW LGWVGKQSLN 

241 IRLAKARASL DGFIDSIIDD HIIRKKAYVN GKNDGGDRET DMVDELLAFY SEEAKVTESE 

301 DLQNAIRLTK DSIKAII^aDV MFGGTETVAS AIEWAMAELM RSPEDLKKVQ QGLANWGLN 

361 RKVEESDFEK LTYLRCCLKE TLRLHPPIPL LLHETAEEST VSGYHIPAKS HVIINSFAIG 

421 RDKNSWEDPE TYKPSRFLKE GVPDFKGGNF EFIPFGSGRR SCPGMQLGLY ALEMAVAHLL 
481 HCFTWELPDG MKPSELKMDD IFGLTAPRAN RLVAVPTPRL LCPLY 
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NAME D135-AE1 
ORGANISM NICOTIANA 
SEQ. ID. NO, 189 

1 GGGGGATAAG 
€1 AATTATCTTT 
121 AGAATTGGAG 
181 TGGGG AT ATG 
241 CTCTCATGAC 
301 TGGTAAGAAT 
361 ACTTGTAAAG 
421 CACAAAATTG 
481 AAGGCTfTCTC 
541 ATTTACTGCT 
601 GATAGATGTG 
661 TGGCAGTAGT 
721 AATTCTAGAA 
781 GAACAAAAGG 
841 GAAAAGATTG 
901 GGCATCCAAT 
961 TGATGAGGTG 
1021 TTTACTTGTA 
1081 AGAAGAGGTT 
1141 AAAAATAGTA 
1201 TAATCGAATG 

12 61 ACTCTTGTTA 

13 21 GGAGTTCAAT 
13 81 GTTCTTTCCA 
1441 GGCCAAGATG 
1501 TGCTCATGCT 
1561 CAAGTTGTAG 

SEQ. ID. NO. 190 

1 MEIPY^'SLKL 
61 EIKKI-IGEEAM 
121 VLTKNF\r/QK 
181 EMLSKLEKW 
241 SRTIYIPGWR 
3 01 KEIQQHGNNK 

3 61 QVFGSREVDY 
421 TILLQHDTEI 

4 81 MAMILKNYAF 



TABACUM 



AATATGGAGA 
GTACTAAGAT 
AAATGCATCA 
AAAGAGATAA 
ATGATTTGGC 
TGTTTTGTGT 
GAGGTGCTAA 
GCAGCAACTG 
AATCCTGCTT 
TGTGAGATGT 
TGGCCATATC 
TATGAAGAAG 
GTGTCCCGCA 
ATGAAGCAAA 
AGTATGATTG 
TTAAAAGAAA 
ATTGAAGAGT 
TGGACTATGA 
TTGCAAGTGT 
ACTATGATCT 
GTAACCAAAG 
CCAACAATTT 
CCAGAGAGAT 
TTTAGTTGGG 
GCAATGGCTA 
CCTCATCCAC 
AAATGGTCAA 



TACCATATTA 
GGGCATGGAA 
GACAGCAGGG 
AGAAAATGGG 
CTAGAGTCAT 
GGTTTGGGCC 
CGAAGAATTT 
GAATTGCAGG 
TTCACCTTGA 
TGAGCAAATT 
TACAAACTTT 
GAAGAAAGCT 
CAATATACAT 
TATTTAATGA 
AAAATGGAGA 
TCCAACAACA 
GTAAACTCTT 
TTTTGTTGTG 
TTGGAAGTAG 
TAAACGAGGT 
AAACAAAGTT 
TGTTGCAACA 
TTAGTGATGG 
GTCCAAGAAT 
TGATTCTGAA 
TACTACTTCA 
TTTGGAACTT 



CAGCTTAAAA 
AATCTTGAAT 
TTTCA^i^GGA 
TGAAGAAGCT 
GCCCTTCATC 
AAGACCAGCA 
CGTTTATCAG 
CTATGAAACA 
CAAGTTGAAG 
GGAGAAAGTT 
AACAAGTGAT 
TTTTGAACTT 
CCCAGGATGG 
AGTACGAGCG 
AGCTCCTGAT 
TGGAAATAAC 
CTATTTTGCG 
CAAACATCCT 
GGAAGTTGAC 
CTTAAGGTTG 
AGGGAATTTA 
TGATACTGAA 
AATATCCAAA 
ATGTATTGGG 
AAACTATGCA 
ACCTCAATAT 
GTTATGGAAC 



CTTACAATTT 
TATGTGTGGT 
AACTCTTACA 
ATGTCTAAGC 
CACAAAACCA 
GTCCTGATCA 
AAGCCACCTG 
GATAAATGGG 
CATATGCTAC 
GTCTCACCAA 
GCCATTTCAA 
CAAAAGGAAC 
AGGTTTTTGC 
CTGGTATTGG 
GATTTATTGG 
AAGAAATTTG 
GGGCAAGAGA 
AGTTGGCAAG 
TATGACAAGT 
TATCCAGCAG 
TGTTTACCAG 
ATATGGGGAG 
GC AACAAAAG 
CAAAATTTTG 
TTTGAACTCT 
GGTGCTCAAT 
TTTTATCATC 



TTTCATTTGC 
TAAAACCAAA 
AATTCTTGTT 
CAATCAATTT 
TCACCAATTA 
CAGACCCGGA 
GCACTCCACT 
CTACACATAG 
CTGCATTCCA 
ATGGAACAGA 
GAACTGCTTT 
AACTTTCACT 
CAACAAAAAG 
GAATTATTAA 
GTATATTATT 
GTATGAGTAT 
CAACTTCATC 
ATAAAGCTAG 
TGAATCAGCT 
GATATGCGAT 
CTGGGGTACA 
ATGATGCAAT 
GAAAACTTGT 
CTATGTTAGA 
CTCCATCTTA 
TAATTTTGTA 
GTAATCAACC 



TIFSFAIIFV 
SKPINFSHDM 
PPGTPLTKLA 
SPNGTEIDVW 
FLPTKRi^JKRM 
KFGMSIDEVI 
DKLNQLKIVT 
WGDDAMEFNP 
ELSPSYAHAP 



LRWAWKILNY 
IWPRVMPFIH 
ATGIAGYETD 
PYLQTLTSDA 
KQIFNEVRAL 
EECKLFYFAG 
MILNEVLRLY 
ERFSDGISKA 
HPLLLQPQYG 



VWLKPKELEK 
KTITNYGKIJC 
KWATHRRLLN 
ISRTAFGSSY 
VLGIIKKRLS 
QETTSSLLVW 
PAGYAINRMV 
TKGKLVFFPF 
AQLILYKL 



CIRQQGFKGN 
FVWFGPRPAV 
PAFHLDKLKH 
EEGRKLFELQ 
MIENGEAPDD 
TMILLCKHPS 
TKETKLGNLC 
SWGPRICIGQ 



SYKFLFGDMK 
LITDPELVKE 
MLPAFQFTAC 
KEQLSLILEV 
LLGILLASNL 
WQDKAREEVL 
LPAGVQLLLP 
NFAl^EAKMA 
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NAME D141-AD7 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 191 

1 GTCCTAACTA AAAATGGAGA TTCAGTTTTC TAACTTAGTT GCATTCTTGC TCTTTCTCTC 

61 CAGCATCTTT CTTCTATTCA AAAAATGGAA AACCAGAAAA CTAAATTTGC CTCCTGGTCC 
121 ATGGAAATTA CCTTTTATTG GAAGTTTACA CCATTTGGCT GTGGCAGGTC CACTTCCTCA 
181 CCATGGCCTA AAAAATTTAG CCAAACGCTA TGGTCCTCTT ATGCATTTAC AACTTGGACA 

2 41 AATTCCTACA CTCATCATAT CATCACCTCA AATGGCA.AAA GAAGTACTAA AAACTCACGA 

3 01 CCTCGCTTTT GCCACTAGAC CAAAGCTTGT CGTGGCCGAC ATCATTCACT ACGACAGCAC 
3 61 GGACATAGCA TTTTCTCCGT ACGGTGAATA CTGGAGACAA ATTCGTAAAA TTTGCATATT 
421 GGAACTCTTG AGTGCCAAGA TGGTCAAATT TTTTAGCTCG ATTCGCCAAG ATGAGCTCTC 
481 GAAGATGCTC TCATCTATAC GAACGACACC CAATCTTACA GTCAATCTTA CTGACAAAAT 
541 TTTTTGGTTT ACGAGTTCGG TAACTTGTAG ATCAGCTTTA GGGAAGATAT GTGGTGACCA 
601 AGACAAATTG ATCATTTTTA TGAGGGAAAT AATATCATTG GCAGGTGGAT TTAGTATTGC 
661 TGATTTTTTC CCTACATGGA AAATGATTCA TGATATTGAT GGTTCGAAAT CTAAACTGGT 
721 GAAAGCACAT CGTAAGATTG ATGAAATTTT GGGAAATGTT GTTGATGAGC ACAAAAAGAA 
781 CAGAGCAGAT GGCAAGAAGG GTAATGGTGA ATTTGGTGGT GAAGATTTGA TTGATGTATT 
841 GTTAAGAGTT AGAGAAAGTG GAGAAGTTCA AATTCCTATC ACAAATGACA ATATCAAATC 
901 AATATTAATC GACATGTTCT CTGCGGGATC TGAAACATCA TCGACGACTA TAATTTGGGC 
961 ATTAGCTGAA ATGATGAAGA AACCAAGTGT TTTAGCAAAG GCACAAGCTG AAGTAAGGCA 

1021 AGCTTTGAAG GAGAAAAAAG GTTTTCAACA GATTGATCTT GATGAGCTAA AATATCTCAA 
1081 GTTAGTAATC AAAGAAACCT TAAGAATGCA CCCTCCAATT CCTCTATTAG TTCCTAGAGA 
1141 ATGTATGGAG GATACAAAGA TTGATGGTTA CAATATACCT TTCAAAACAA GAGTCATAGT 
1201 TAATGCATGG GCAATCGGAC GAGATCCAGA AAGTTGGGAT GACCCCGAAA GCTTTATGCC 

12 61 AGAGAGATTT GAGAATAGTT CTATTGACTT TCTTGGAAAT CATCATCAGT TTATACCATT 
1321 TGGTGCAGGA AGAAGGATTT GTCCGGGAAT GCTATTTGGT TTAGCTAATG TTGGACAACC 

13 81 TTTAGCTCAG TTACTTTATC ACTTCGATTG GAAACTCCCT AATGGACAAA GTCATGAGAA 
1441 TTTCGACATG ACTGAGTCAC CTGGAATTTC TGCTACAAGA AAGGATGATC TTGTTTTGAT 
1501 TGCCACTCCT TATGATTCTT ATTAAGCAGT AGCAGAAATA AAAAGCCGGG GCAAACAGAA 
1561 AAAAGT 

SEQ. ID. NO. 192 

1 MEIQFSNLVA FLLFLSSIFL LFKKWKTFOOi NLiPPGPWKLP FIGSLHHLAV AGPLPHHGLK 
61 NLAKRYGPLM HLQLGQIPTL IISSPQMAKE VLKTHDLAFA TRPKLWADI IHYDSTDIAF 
121 SPYGE'/VmQI RKICILELLS AKMVKFFSSI RQDELSKMLS SIRTTPNLTV NLTDKIFWFT 
181 SSVTCRSALG KICGDQDKLI IFMREIISLA GGFSIADFFP TWK14IHBIDG SKSKLVKAHR 
241 KIDEILGNW DEHKKNRADG KKGNGEFGGE DLIDVLLRVR ESGEVQIPIT NDNIKSILID 
3 01 MFSAGSETSS TTIIWALAEM MKKPSVLAKA QAEVRQALKE KKGFQQIDLD ELKYLKLVIK 
361 ETLRMHPPIP I/LVPRECMED TKIDGYNIPF KTRVIVNAWA IGRDPESWDD PESFMPERFE 
421 NSSIDFLGNH HQFIPFGAGR RICPGMLFGL ANVGQPLAQL LYHFDWKLPN GQSHENFDMT 
481 ESPGISATRK DDLVLIATPY DSY 



37/141 



wo 2005/038033 



PCT/US2004/034065 



NAME D147-AD3 
ORGANISM NICOTIANA TABACUM 
SEQ. ID, NO, 193 

1 CAACTAACAA ACACATTGAG TCCTCTCCCA AATCACTGAT TCACCACCAA AAGTACCAAC 
61 AATTCAATGG AAGGTACAAA CTTGACTACA TATGCAGCAG TATTTCTTGA TACTCTGTTT 
121 CTTTTGTTCC TTTCCAAACT TCTTCGCCAG AGGAAACTCA ATTTACCTCC AGGCCC AAAA 
181 CCATGGCCGA TCATCGGAAA CTTAAACCTT ATTGGCAATC TTCCTCATCG CTCAATCCAC 
241 GAACTCTCCC TCAAGTACGG ACCCGTTATG CAACTCCAAT TCGGGTCTTT CCCCGTTGTA 
301 GTTGGATCCT CCGTCGAAAT GGCTAAGATT TTCCTCAAAT CCATGGATAT TAACTTTGTA 
361 GGCAGGCCTA AAACGGCTGC CGGAAAATAC ACAACGTACA ATTAXTCCGA TATTACATGG 
421 TCTCCTTACG GACCATATTG GCGCCAGGCA CGTAGGATGT GCCTAACGGA ATTATTCAGC 
481 ACGAAACGTC TCGATTCATA CGAGTATATT CGGGCTGAGG AGTTGCATTC TCTTCTCCAT 
541 AATTTGAACA AAATATCAGG GAAACCAATT GTGTTGAAAG ATTATTCGAC GACGTTGAGT 
601 TTAAATGTTA TTAGCAGGAT GGTACTGGGG AAAAGGTATT TGGACGAATC CGAGAACTCG 
661 TTCGTGAATC CTGAGGAATT TAAGAAGATG TTGGACGAAT TGTTTTTGCT AAATGGTGTA 
721 CTTAATATTG GAGATTCAAT TCCATGGATT GATTTCATGG ATTTGCAAGG TTATGTTAAG 
7 81 AGGATGAAAG TAGTGAGCAA GAAATTCGAC AAGTTTTTAG AGCATGTTAT TGATGAGCAT 
841 AACATTAGGA GAAATGGAGT GGAGAATTAT GTTGCTAAGG ATATGGTGGA TGTTTTGTTG 
901 CAGCTCGCTG ATGATCCGAA GTTGGAAGTT AAGCTGGAGA GACATGGAGT CAAAGCATTC 
961 ACTCAGGATA TGCTGGCTGG TGGAACCGAG AGTTCAGCAG TGACAGTGGA GTGGGCAATT 
1021 TCAGAGCTGC TAAAGAAGCC GG AGATTTTC AAAAAGGCTA CAGAAGAATT GGATCGAGTA 
1081 ATTGGGCAGA ATAGATGGGT ACAAGAAAAG GACATTCCAA ATCTTCCTTA CATAGAGGCA 
1141 ATAGTCAAAG AGACTATGCG ACTGCACCCC GTGGCACCAA TGTTGGTGCC ACGTGAGTGT 
1201 CGAGAAGATA TTAAGGTAGC AGGCTACGAC GTTCAGAAAG GAACTAGGGT TCTCGTGAGT 
1261 GTATGGACTA TTGGAAGAGA CCCTACATTG TGGGACGAGC CTGAGGTGTT CAAGCCGGAG 
1321 AGATTCCATG AAAGGTCCAT AGATGTTAAA GGACATGATT ATGAGCTTTT GCC ATTTGGA 
1381 GCGGGGAGAA GAATGTGCCC GGGTTATAGC TTGGGGCTCA AGGTGATTCA AGCTAGCTTA 
1441 GCTA^ivTCTTC TACATGGATT TAACTGGTCA TTGCCTGATA ATATGACTCC TGAGGACCTC 
1501 AACATGGATG AGATTTTTGG GCTCTCTACA CCTAAAAAAT TTCCACTTGC TACTGTGATT 
1561 GAGCCAAGAC TTTCACCAAA ACTTTACTCT GTTTGATTCA GCAGTTCTAT GGTTCCGTCA 
1621 AGATAGACTT TGTTACGTTT GAACCTGTGC TO 



SEQ. ID. NO. 194 

1 MEGTNLTTYA AVFLDTLFLL FLSKLLRQRK LNLPPGPKPW PIIGNLNLIG NLPHRSIHEL 
61 SLKYGPVMQL QFGSFPVWG S£P/EhLAKIFL KSMDINFVGR PKTAAGKYTT \TJYSDITWSP 
121 YGPYWRQARR MCLTELFSTK RJLDSYEYIRA EELHSLLHNL NKISGKPIVL KDYSTTLSLN 
181 VISRMVLGKR YLDESENSFV NPEEFKKMLD ELFLLNGVLN IGDSIPWIDF MDLQGYVKRM 

2 41 KWSKKFDKF LEHVIDEHNI RRNGVENYVA KDMVDVLLQL ADDPKLEVKL ERHGVKAFTQ 

3 01 DMLAGGTESS AVTVEWAISE LLKKPEIFKK ATEELDRVIG QNRWVQEKDI PNLPYIEAIV 
3 61 KETMFaHPVA PMLVPRECRE DIKVAGYDVQ KGTRVLVSVW TIGRDPTLWD EPEVFKPERF 
421 HERSIDVKGH DYELLPFGAG RRi^CPGYSLG LKVIQASLAN LLHGFNWSLP DNMTPEDLNM 
481 DEIFGLSTPK KFPLATVIEP RLSPKLYSV 
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NAME D163-AF12 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 195 

1 CTTCTTCCTT CCTAACTAAA AATGGAGATT CAGTTTTCTA ACTTAGTTGC ATTCTTGCTC 
61 TTTCTCTCCA GCATCTTTCT TGTATTCAAA AAATGGAAAA CCAGAAAACT AAATTTGCCT 
121 CCTGGTCCAT GGAAATTACC TTTTATTGGA AGTTTACACC ATTTGGCTGT GGCAGGTCCA 
181 CTTCCTCACC ATGGCCTAAA AAATTTAGCC AAACGCTATG GTCCTCTTAT GCATTTACAA 
241 CTTGGACAAA TTCCTACACT CGTCATATCA TCACCTCAAA TGGCAAAAGA AGTACTAAAA 
301 ACTCACGACC TCGCTTTTGC CACTAaACCA AAGCTTGTCG TGGCCGACAT CATTCACTAC 
361 GACAGCACGG ACATAGCATT TTCGCCATAC GGTGAATACT GGAGACAAAT TCGTAAAATT 
421 TGCATATTGG AACTCTTGAG TGCCAAGATG GTCAAGTTTT TTAGCTCGAT TCGCCAAGAT 
481 GAGCTCTCGA AGATGGTTTC ATCTATACGA ACGACGCCCA ATCTTCCAGT CA-ATCTTACC 
541 GACAAGATTT TTTGGTTTAC GAGTTCGGTA ATTTGTAGAT CAGCTTTAGG GAAGATATGT 
601 GGTGACCAAG ACAAATTGAT CATTTTTATG AGGGAAATAA TATCATTGGC AGGTGGATTT 
661 AGTATTGCTG ATTTTTTCCC TACATGGAAA ATGATTCATG ATATTGATGG TTCAAAATCT 
721 AAACTGGTGA AGGCACATCG TAAGATTGAT GAAATTTTGG AAAATGTGGT AAATGAGCAC 
781 AAACAGAATC GAGCAGATGG TAAAAAGGGT AATGGTGAAT TTGGTGGAGA AGATCTGATT 
841 GATGTTTTGT TAAGAGTTAG AGAAAGTGGA GAAGTTCAAA TTCCAATCAC AGATGACAAT 
901 ATCAAATCAA TATTAATCGA CATGTTCTCT GCCGGATCGG AAACATCATC GACAACTATA 
961 ATTTGGGCAT TAGCTGAAAT GATGAAGAAA CCAAGTGTTT TAGCAAAGGC ACAAGCTGAA 
1021 GTGAGGCAAG CTTTGAAGGG GAAGAAAATT AGTTTTCAAG AGATTGATAT TGATAAGCTA 
1081 AAGTATTTGA AGTTAGTGAT CAAAGAAACT TTAAGAATGC ACCCTCCAAT TCCTCTGTTA 
1141 GTCCCTAGAG AATGTATGGA AGATACAAAG ATTGATGGTT ACAATATACC TTTCAAAACA 
1201 AGAGTCATTG TTAATGCATG GGCAATTGGA CGAGATCCTC AAAGTTGGGA TGATCCTGAA 
1261 AGCTTTACGC CAGAGAGATT TGAGAATAAT TCTATTGATT TTCTTGGAAA TCATCATCAA 
1321 TTTATTCCAT TTGGTGCAGG AAGAAGGATT TGTCCTGGAA TGCTATTTGG TTTAGCTAAT 
13 81 GTTGGACAAC CTTTAGCTCA GTTACTTTAT CACTTCGATT GGAAACTCCC TAATGGACAA 
1441 AGTCATGAGA ATTTCGACAT GACTGAGTCA CCTGGAATTT CTGCTACAAG AAAGGATGAT 
1501 CTTGTTTTGA TTGCCACTCC TTATGATTCT TATTAAGCAG TAGCAGAAAT AAAAAGCCGG 
1561 GGCAAACAGA AAAAAGTATT GCTGCTTCTA GGTATTTTCT ATTGGATAAA TTTCAAAATT 
1621 CATCCACAAT ATTTAGTGTT TGCTAGAGTT GGTTAGC 

SEQ. ID. NO. 196 

1 MEIQFSNLVA FLLFLSSIFL VFKKWKTRKL ^a»PPGPWKIiP FIGSLHHLAV AGPLPHHGLK 

61 NLAKRYGPLM HLQLGQIPTL VISSPQMAKE VLKTHDLAFA TRPKLWADI IHYDSTDIAF 

121 SPYGEYWRQI RKICILELLS AKMVKFFSSI RQDELSKMVS SIRTTPNLPV tJLTDKIFWFT 

181 SSVICRSALG KICGDQDKLI IFMREIISLA GGFSIADFFP TWKMIHDIDG SKSKLVKAHR 

2 41 KIDEILENW ^TEHKQ^mADG KKGNGEFGGE DLIDVLLRVR ESGEVQIPIT DDNIKSILID 

3 01 MFSAGSETSS TTIIWALAEM t^mKPSVLAKA QAEVRQALKG KKISFQEIDI DKLKYLKLVI 
3 61 KETLRMHPPI PLLVPRECME DTKIDGYNIP FKTRVIVIJAW AIGRJDPQSWD DPESFTPERF 
421 ENNSIDFLGN HHQFIPFGAG RRICPGMLFG LANVGQPLAQ LLYHFDWKJLP NGQSHENFDM 
481 TESPGISATR KDDLVLIATP YDSY 
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NAME D163-AG11 
ORGANISM NICOTIANA 
SEQ» ID. NO. 197 

1 CTTCTTCCTT 
61 TTTCTCTCCA 
121 CCTGGTCCAT 
181 CTTCCTCACC 
241 CTTGGACAAA 
3 01 ACTCACGACC 
361 GACAGCACGG 
421 TGCATATTGG 
481 GAGCTCTCGA 
541 GACA-^VGATTT 
601 GGTGACCAAG 
661 AGTATTGCTG 
721 AAACTGGTGA 
781 AAACAGAATC 
841 GATGTTTTGT 
901 ATCAAATCAA 
961 ATTTGGGCAT 
1021 GTGAGCCAAG 
1081 AAGTATTTGA 
1141 GTCCCTAGAG 
12 01 AGAGTCATTG 
12 61 AGCTTTACGC 
1321 TTTATTCCAT 
1381 GTTGGACAAC 
1441 ACTCACCAAA 
1501 CTTATTTTGA 
1561 AATTTTCAAA 

SEQ. ID. NO. 198 

1 MEIQFSNLVA 

61 tJLAKRYGPLM 

121 SPYGEYWRQI 

181 SSVICRSALG 

2 41 KIDEILENW 

3 01 MFSAGSETSS 

3 61 KETLRMHPPI 
421 ENNSIDFLGN 

4 81 TESPGISATR 



TABACUM 



CCTAACTAAA 
GCATCTTTCT 
GG AAATTACC 
ATGGCCTAAA 
TTCCTACACT 
TCGCTTTTGC 
ACATAGCACT 
AACTCTTGAG 
AGATGGTTTC 
TTTGGTTTAC 
ACAAATTGAT 
ATTTTTTCCC 
AGGCACATCG 
GAGCAGATGG 
TAAGAGTTAG 
TATTAATCGA 
TAGCTGAAAT 
CTTTGAAGGG 
AGTTAGTGAT 
AATGTATGGA 
TTAATGCATG 
CAGAGAGATT 
TTGGTGCAGG 
CTTTAGCTCA 
ATTTCGACAT 
TTGCCACTCC 
ATTCATCCAC 



AATGGAGATT 
TGTATTCAAA 
TTTTATTGGA 
AAATTTAGCC 
CGTCATATCA 
CACTAGACCA 
TTCGCCATAC 
TGCCAAGATG 
ATCTATACGA 
GAGTTCGGTA 
CATTTTTATG 
TACATGGAAA 
TAAGATTGAT 
TAAAAAGGGT 
AGAAAGTGGA 
CATGTTCTCT 
GATGAAGAAA 
GAAGAAAATT 
CAAAGAAACT 
AGATACAAAG 
GGC AATTGGA 
TGAGAATAAT 
AAGAAGGATT 
GTTACTTTAT 
GACTGAGTCA 
TGCTCATTCT 
AATATATAGT 



CAGTTTTCTA 
AAATGGAAAA 
AGTTTACACC 
.\AACGCTATG 
TCACCTCAAA 
AAGCTTGTCG 
GGTGAATACT 
GTCAAGTTTT 
ACGACGCCCA 
ATTTGTAGAT 
AGGGAAATAA 
ATGATTCATG 
GAAATTTTGG 
AATGGTGAAT 
GAAGTTCAAA 
GCCGGATCGG 
CCAAGTGTTT 
AGTTTTCAAG 
TTAAGAATGC 
ATTGATGGTT 
CGAGATCCTC 
TCTATTGATT 
TGTCCTGGAA 
CACTTCGATT 
CCTGGAATTT 
TGATTAAGTA 
GTTTGCTAGA 



ACTTAGTTGC 
CCAGAAAACT 
ATTTGGCTGT 
GTCCTCTTAT 
TGGCAAAAGA 
TGGCCGACAT 
GGAGACAAAT 
TTAGCTCOAT 
ATCTTCCAGT 
CAGCTTTAGG 
TATCATTGGC 
ATATTGATGG 
AAAATGTGGT 
TTGGTGGAGA 
TTCCAATCAC 
AAACATCATC 
TAGCAAAGGC 
AGATTGATAT 
ACCCTCCAAT 
ACAATATACC 
AAAGTTGGGA 
TTCTTGGAAA 
TGCTATTTGG 
GGAAACTCCC 
CTGCTACAAG 
TTGCTGCTTT 
GTTGGTTAGC 



ATTCTTGCTC 

AAArrrGCCT 

GGCAGGTCCA 
GCATTTACAA 
AGTACTAAAA 
CATTCACTAC 
TCGTAAAATT 
TCGCCAAGAT 
CAATCTTACC 
GAAGATATGT 
AGGTGGATTT 
TTCAAAATCT 
AAATGAGCAC 
AGATCTGATT 
AGATGACAAT 
GACAACTATA 
ACAAGCTGAA 
TGATAAGCTA 
TCCTCTGTTA 
TTTCAAAACA 
TGATCCTGAA 
TCATCATCAA 
TTTAGCTAAT 
TAATGGACAA 
AAAGGATGAT 
TCTATTGGAG 



FLLFLSSIFL 
HLQLGQIPTL 
RKICILELLS 
KICGDQDKLI 
NEHKQNRADG 
TTIIWALAEM 
PLLVPRECME 
HHQFIPFGAG 
KDDLILIATP 



VFKKWKTRKL 
VISSPQMAKE 
AKMVKFFSSI 
IFMREIISLA 
KKGNGEFGGE 
MKKPSVLAKA 
DTKIDGYNIP 
RRICPGMLFG 
AHS 



NLPPGPWKLP 
VLKTHDLAFA 
RQDELSKIWS 
GGFSIADFFP 
DLIDVLLRVR 
QAEVSQALKG 
FKTRVIVNAW 
LANVGQPLAQ 



FIGSLHHLAV 
TRPKLWADI 
SIRTTPNLPV 
TWKMIHDIDG 
ESGEVQIPIT 
KKISFQEIDI 
AIGRDPQSWD 
LLYHFDWKIiP 



AGPLPHHGLK 
IHYDSTDIAL 
NLTDKIFWFT 
SKSKLVKAHR 
DDNIKSILID 
DKLKYLKLVI 
DPESFTPERF 
NGQTHQNFDM 
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NAME D163-AG12 
ORGANISM NICOTIANA 
SEQ. ID, NO. 199 

1 ATCCTTCTTC 
61 ATTCTTGCTC 
121 AAATTTGCCT 
181 GGCAGGTCCA 
241 GCATTTACAA 
301 AGTACTAAAA 
361 CATTCACTAC 
421 TCGTAAAATT 
481 TCGCCAAGAT 
541 CAATCTTACT 
601 GAAGATATGT 
661 AGGTGGATTT 
721 TTCGAAATCT 
7 81 TGATGAGCAC 
841 AGATTTGATT 
901 AAATGACAAT 
961 GACGACTATA 
1021 ACAAGCTGAA 
1081 TGAGCTAAAA 
1141 TCTATTAGTT 
1201 CAAAACAAGA 
1261 CCCCGAAAGC 
1321 TCATCAGTTT 
13 81 AGCTAATGTT 
1441 TGGACAAAGT 
1501 GGATGATCTT 
1561 AAGCCGGGGC 
1621 CAAAATTCAT 

SEQ. ID. NO. 200 

1 MEIQFSNIiVA 
61 NLAKRYGPLM 
121 SPYGEYWRQI 
181 SSVTCRSALG 

2 41 KIDEILGNW 
301 I^SAGSETSS 

3 61 ETLRl^PPIP 
421 NSSIDFLGNH 
481 ESPGISATRK 



TABACUM 



CTTCCTAGGT 
TTTCTCTCCA 
CCTGGTCCAT 
CTTCCTCACC 
CTTGGACAAA 
ACTCACGACC 
GACAGCACGG 
TGCATATTGG 
GAGCTCTCGA 
GACAAAATTT 
GGTGACCAAG 
AGTATTGCTG 
AAACTGGTGA 
AAAAAGAACA 
GATGTATTGT 
ATCAAATCAA 
ATTTGGGCAT 
GTAAGGCAAG 
TATCTCAAGT 
CCTAGAGAAT 
GTCATAGTTA 
TTTATGCCAG 
ATACCATTTG 
GGACAACCTT 
CATGAGAATT 
GTTTTGATTG 
AAACAGAAAA 
CCACAATATT 



CCTAACTAAA 
GCATCTTTCT 
GGAAATTACC 
ATGGCCTAAA 
TTCCTACACT 
TCGCTTTTGC 
ACATAGCATT 
A.2^CTCTTGAG 
AGATGCTCTC 
TTTGGTTTAC 
ACAAATTGAT 
ATTTTTTCCC 
AAGCACATCG 
GAGCAGATGG 
TAAGAGTTAG 
TATTAATCGA 
TAGCTGAAAT 
CTTTGAAGGA 
TAGTAATCAA 
GTATGGAGGA 
ATGCATGGGC 
AGAGATTTGA 
GTGCAGGAAG 
TAGCTCAGTT 
TCGACATGAC 
CCACTCCTTA 
AAGTATTGCT 
TAGTGTTTGC 



AATGGAGATT 
TCTATTCAAA 
TTTTATTGGA 
AAATTTAGCC 
CATCATATCA 
CACTAGACCA 
TTCTCCGTAC 
TGCCAAGATG 
ATCTATACGA 
GAGTTCGGTA 
CATTTTTATG 
TACATGGAAA 
TAAGATTGAT 
CAAG.AAGGGT 
AGAAAGTGGA 
CATGTTCTCT 
GATGAAGAAA 
GAAAAAAGGT 
AGAAACCTTA 
TACAAAGATT 
AATCGGACGA 
GAATAGTTCT 
AAGGATTTGT 
ACTTTATCAC 
TGAGTCACCT 
TGATTCTTAT 
GCTTCTAGGT 
TAGAGTTGGT 



CAGTTTTCTA 
AAATGGAAAA 
AGTTTACACC 
AAACGCTATG 
TCACCTCAAA 
AAGCTTGTCG 
GGTGAATACT 
GTCAAATTTT 
ACGACACCCA 
ACTTGTAGAT 
AGGGAAATAA 
ATGATTCATG 
GAAATTTTGG 
AATGGTGAAT 
GAAGTTCAAA 
GCGGGATCTG 
CCAAGTGTTT 
TTTCAACAGA 
AGAATGCACC 
GATGGTTACA 
GATCCAGAAA 
ATTGACTTTC 
CCGGGAATGC 
TTCGATTGGA 
GGAATTTCTG 
TAAGCAGTAG 
ATTTTCTATT 
TAGC 



ACTTAGTTGC 
CCAGAAAACT 
ATTTGGCTGT 
GTCCTCTTAT 
TGGCAAAAGA 
TGGCCGACAT 
GGAGACAAAT 
TTAGCTCGAT 
ATCTTACAGT 
CAGCTTTAGG 
TATCATTGGC 
ATATTGATGG 
GAAATGTTGT 
TTGGTGGTGA 
TTCCTATCAC 
AAACATCATC 
TAGCAAAGGC 
TTGATCTTGA 
CTCCAATTCC 
ATATACCTTT 
GTTGGGATGA 
TTGGAAATCA 
TATTTGGTTT 
AACTCCCTAA 
CTACAAGAAA 
CAGAAATAAA 
GGATAAATTT 



FLLFLSSIFL 
HLQLGQIPTL 
RKICILELLS 
KICGDQDKLI 
DEHKKNRADG 
TTIIWALiAEM 
LLVPRECMED 
HQFIPFGAGR 
DDLVLIATPY 



LFKKWKTRKIi 
IISSPQMAKE 
AKMVKFFSSI 
IFMREIISLA 
KKGNGEFGGE 
MKKPSVLAKA 
TKIDGYNIPF 
RICPGMLFGL 
DSY 



NLPPGPWKLP 
VLKTHDLAFA 
RQDELSKMLS 
GGFSIADFFP 
DLIDVLLRVR 
QAEVRQALKE 
KTRVIVNAWA 
ANVGQPLAQL 



FIGSLHHLAV 
TRPKLWADI 
SIRTTPNLTV 
TVn<MIHDIDG 
ESGEVQIPIT 
KKGFQQIDLD 
IGRDPESWDD 
LYHFDWKLPN 



AGPLPHHGLK 
IHYDSTDIAF 

NLTDKIFWFT 
SKSKLVKAHR 
NDNIKSILID 
ELKYLKLVIK 
PESFMPERFE 
GQSHENFDMT 
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NAME D205-BG9 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO, 201 

1 TTCTTATTTT GATTCAACCA TGGAGAACCA ATACTCCTAC TCATTCTCTT CCTACTTCTA 
61 CTTAGCTATA GTACTGTTTC TTCTTCCAAT TTTGGTCAAA TATTTCTTCC ATCGGAGAAG 
121 AAATTTACCT CCAAGTCCAT TTTCTCTTCC AATAATTGGT CACCTTTACC TTCTCAAGAA 
181 AACTCTCCAT CTCACTCTAA CATCCTTATC AGCTAAATAT GGTCCTGTTT TATACCTCAA 
241 ATTGGGCTCT ATGCCTGTGA TTGTTGTGTC CTCACCATCT GCTGTTGAAG AATGTTTAAC 
301 CAAGAATGAT ATCATATTCG CAAATAGGCC CAAGACCGTG GCTGGTGACA AGTTTACCTA 
361 CAATTATACT GTTTATGTTT GGGCACCCTA TGGCCAACTT TGGAGAATTC TTCGCCGATT 
421 AACTGTCGTT GAACTCTTCT CTTCACATAG CCTACAGAAA ACTTCTATCC TTAGAGATCA 
481 AGAAGTTGCA ATATTTATCC GTTCGTTATA CAAATTCTCA AAGGATAGTA GCAAAAAAGT 
541 CGATTTGACC AACTGGTCTT TTACTTTGGT TTTCAATCTT ATGACCAAAA TTATTGCTGG 
601 GAGACATATT GTGAAGGAGG AAGATGCTGG CAAGGAAAAG GGCATTGAAA TTATTGAAAA 
661 ACTTAGAGGG ACTTTCTTAG TAACTACATC ATTCTTGAAT ATGTGTGATT TCTTGCCAGT 
721 ATTCAGGTGG GTTGGTTACA AAGGGCTGGA GAAGAAGATG GCCTCAATTC ACAATAGAAG 
781 AAATGAATTC TTGAACAGCT TGCTTGATGA ATTTCGACAC AAGAAAAGTA GTGCTTCACA 
841 ATCTAACACA ACTGTTGGAA ACATGGAGAA GAAAACCACA CTGATTGAAA AGCTCTTGTC 
901 TCTTCAAGAA TCAGAGCCTG AATTCTACAC TGATGATATC ATCAAAAGTA TTATGCTGGT 
961 AGTTTTTGTT GCAGGAACAG AGACCTCATC AACAACCATC CAATGGGTAA TGAGGCTTCT 
1021 TGTAGCTCAC CCTGAGGCAT TGTATAAGCT ACGAGCTGAC ATTGACAGTA AAGTTGGGAA 
1081 TAAGCGCTTG CTGAATGAAT CAGACCTCAA CAAGCTTCCG TATTTGCATT GTGTTGTTAA 
1141 TGAGACAATG AGATTATACA CTCCGATACC ACTTTTATTG CCTCATTATT CAACTAAAGA 
1201 TTGTATTGTG GAAGGATATG ATGTACCAAA ACATACAATG TTGTTTGTCA ACGCTTGGGC 
1261 CATTCACAGG GATCCCAAGG TATGGGAGGA GCCTGACAAG TTCAAGCCAG AGAGATTTGA 
1321 GGCAACAGAA GGGGAAACAG AAAGGTTCAA TTACAAGCTT GTACCATTTG GAATGGGGAG 
1381 AAGAGCGTGC CCTGGAGCTG ATATGGGGTT GCGAGCAGTT TCTTTGGCAT TAGGTGCACT 
1441 TATTCAATGC TTTGACTGGC AAATTGAGGA AGCGGAAAGC TTGGAGGAAA GCTATAATTC 
1501 TAGAATGACT ATGCAGAACA AGCCTTTGAA GGTTGTCTGC ACTCCACGCG AAGATCTTGG 
1561 CCAGCTTCTA TCCCAACTCT AAGGCAATTT ATCAATGCCA AACGTAATCT TCATCTACCA 
1621 CTATG 

SEQ, ID. NO. 202 

1 MENQYSYSFS SYFYIiAIVLF LLPILVKYFF HRRRNLPPSP FSLPIIGHLY LLKKTLHLTL 

61 TSLSAKYGPV LYLKLGSMPV IWSSPSAVE ECLTKNDIIF ANRPKT\^AGD KFTYNYTVYV 

121 WAPYGQLWRI LRRLTWELF SSHSLQKTSI LRDQEVAIFI RSLYKFSKDS SKKVDLTNWS 

181 FTLVFNIil^TK IIAGRHIVKE EDAGKEKGIE IIEKLRGTFL VTTSFLN14CD FLPVFRWVGY 

241 KGLEKKMASI HNRRNEFLNS LLDEFRHKKS SASQSNTTVG NMEKKTTLIE KLLSLQESEP 

301 EFYTDDIIKS IMLWFVAGT ETSSTTIQWV MRLLVAHPEA LYKLRADIDS KVGNKRLLNE 

361 SDLNKLPYLH CWNETHRLY TPIPLLLPHY STKDCIVEGY DVPKHTMLFV NAWAIHRDPK 

421 VWEEPDKFKP ERFEATEGET ERFNYKLVPF GMGRRACPGA DMGLRAVSLA LGALIQCFDW 

481 QIEEAESLEE SYNSRMTMQN KPLKWCTPR EDLGQLLSQL 
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NAME D207-AA5 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 203 

1 AACCAACCTT CCTTTTCTTA CTTAGTAAAA 
61 TTGCTTTGCT ACTCTTCATT TCATTTCTTT 
121 TCCCAAAGTT ACCTCCAGGT CCATGGAGAC 
181 AAGGTAAACT CCCACACCAT CATCTTAGAG 
241 ATTTACAACT TGGAGAAGTT CCTGTAGTTG 
301 TACTAAAAAC TCATGATCTT GCTTTTGCAA 
361 TGTTTTACAA AAGCAGGGAC ATATCATTCG 
421 GTAAAATATT AACACAAGAA CTCTTGAGTA 
481 GAAAGGATGA GCTCTCGAAG CTCCTCTCGT 
541 TGAACATAAA CGAAAAGCTT CTCTGGTTTA 
601 GAAAAATATG CAACGATCGT GATGAATTGA 
661 CAGGAGGATT TGATGTGTGT GATTTGTTCC 
721 ACATGAAAGC TAGATTGACG AATGTTCACC 
781 TCAATGAGCA CAAAGAGAAT CATGCAGCAG 
841 AAGATATGAT TGATGCTTTA CTGAGGGTTA 
901 AAAATGACAA CATGAAAGCA GTAATTCTGG 
961 ATACTGCAAT TATATGGGCA CTATCAGAAT 
1021 CACAAGCTGA AGTGAGAAAA GTCTTCAAAG 
1081 ACAAGTTGCC ATACTTAAAA TCAGTGATCA 
1141 CTTTATTAGG ACCTAGAGAA TGCAGAGAAC 

12 01 TTAAAGCTAG AGTAATGGTT AATGCATGGG 
1261 ATCCTGAAAG TTTCAAACCC GAGCGATTTG 
1321 ACTATCAGTT CATCCCTTTC GGTTCAGGAA 

13 81 TAGTTAACAC TGGGCATCCT TTAGCTCAGT 
1441 ATAAGGTTAA TGCAGCTGAT TTTCACACTA 
1501 AAGATGACCT CTACTTGATT CCAACAAATC 
1561 TCTTGTCTTG GAACAATAAA AGAAGAAACT 
1621 TTATATTAGT ATGGGTGTGT TCAGTTTCTT 
1681 CTATATAAAC CAGTGAGACT TTTTATTGGT 
1741 CATATTTTAT TCAAAAAAAA AAAAAAA 



TGGATATTCA GTCTTCTCCT TTCAACTTAA 
TTATCCTATT GAAAAAGTGG AATACCAAAA 
TTCCCCTTAT TGGCAGCCTC CATCACTTGA 
ATTTAGCCCG AAAATATGGA CCTCTCATGT 
TAATATCTTC GCCACGTATA GCAAAAGCTG 
CGAGGCCTCG GTTCATGTCC TCGGACATTG 
CCCCATATGG CGATTACTGG AGACAAATGC 
ACAAGATGCT CAAGTCATTT AGCACAATCC 
CGATTCGTTT AGCAACAGCT TCTTCTGCAG 
CAAGTTGCAT GACTTGTAGA TTAGCCTTTG 
TTATGTTAAT AAGGGAGATA TTAGCATTAT 
CTTCATGGAA ATTACTTCAC AATATGAGCA 
ATAAGTATAA TCTAATTATG GAGAATATCA 
GGATAAAGGG AAATAACGAG TTTGGTGGCG 
AGGAGAATAA TGAGCTTCAA TTTCCTATCG 
ACTTGTTTAT TGCTGGAACT GAAACTTCAT 
TGATGAAGCA CCCAAGTGTT ATGGCCAAGG 
AAAATGAAAA CTTGGACGAA AATGATCTTG 
AAGAAACACT AAGGATGCAT CCTCCAGTTC 
AAACTGAGAT TGATGGATAT ACTGTACCTC 
CAATTGGAAG AGATCCTGAA AGriGGGAAG 
AAAATATTTC TGTTGATCTT ACGGGAAATC 
GAAGAATGTG TCCAGGAATG TCGTTTGGTT 
TGCTCTATTT CTTTGACTGG AAATTCCCTC 
CTGAAACAAG TAGAGTTTTT GCAGCAAGCA 
ACATGGAGCA AGAGTAGCTC TAAATTGAAT 
CCAGCTTGGT CTACATTATT TCTTTTTGCT 
ATTTTTAAGG GTACCCTGAA AGATAAAGGG 
TGCAAGGTTT TAGATCAAGC CATAAGACAG 



SEQ. ID. NO, 204 

1 MDIQSSPFNL lALLLFISFL FILLKKWNTK 

61 DLARKYGPLM YLQLGEVPW VISSPRIAKA 

121 APYGDYWRQM RKILTQELLS NKMLKSFSTI 

181 TSCMTCRIiAF GKICNDRDEL IMLIREILAL 

241 HKYNLIMENI INEHKEMHAA GIKGNNEFGG 

301 DLFIAGTETS YTAIIWALSE LMKHPSVMAK 

361 KETLRMHPPV PLLGPRECRE QTEIDGYTVP 

421 ENISVDLTGN HYQFIPFGSG RR^^CPGMSFG 

481 TETSRVFAAS KDDLYLIPTN H14EQE 



IPKLPPGPWR LPLIGSLHHL KGKLPHHHIiR 
VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 
RKDELSKLLS SIRLATASSA VNIl^KLLWF 
SGGFDVCDLF PSWKLLHNMS rjMKARLTOVH 
EDMIDALLRV KENNELQFPI ENDNMKAVIL 
AQAEVRKVFK ENENLDENDL DKLPYbKSVI 
LKARVMVNAW AIGRDPESWE DPESFKPERF 
LVNTGHPLAQ LLYFFDWKFP HKVNAADFHT 
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NAME D207-AB4 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 205 

1 AACCAACCTT CCTTTTCTTA CTTAGTAAAA TGGATATTCA GTCTTCTCCT TTCAACTTAA 
61 TTGCTTTGCT ACTCTTCATT TCATTTCTTT TTATCCTATT GAAAAAGTGG AATACCAAAA 
121 TCCCAAAGTT ACCTCCAGGT CCATGGAGAC TTCCCCTTAT TGGCAGCCTC CATCACTTGA 
181 AAGGTAAACT CCCACACCAT CATCTTAGAG ATTTAGCCCG AAAATATGGA CCTCTCATGT 
241 ATTTACAACT TGGAGAAGTT CCTGTAGTTG TAATATCTTC GCCACGTATA GCAAAAGCTG 
301 TACTAAAAAC TCATGATCTT GCTTTTGCAA CGAGGCCTCG GTTCATGTCC TCGGACATTG 
361 TGTTTTACAA AAGCAGGGAC ATATCATTCG CCCCATATGG CGATTACTGG AGACAAATGC 
421 GTAAAATATT AACACAAGAA CTCTTGAGTA ACAAGATGCT CAAGTCATTT AGCACAATCC 
481 GAAAGGATGG GCTCTCGAAG CTCCTCTCGT CGATTCGTTT AGCAACAGCT TCTTCTGCAG 
541 TGAACATAAA CGAAAAGCTT CTCTGGTTTA CAAGTTGCAT GACTTGTAGA TTAGCCTTTG 
601 GAAAAATATG CAACGATCGT GATGAATTGA TTATGTTAAT AAGGGAGATA TTAGCATTAT 
661 CAGGAGGATT TGATGTGTGT GATTTGTTCC CTTCATGGAA ATTACTTCAC AATATGAGCA 
721 ACATGAAAGC TAGATTGACG AATGTTCACC ATAAGTATAA TCTAATTATG GAGAATATCA 
781 TCAATGAGCA CAAAGAGAAT CATGCAGCAG GGATAAAGGG AAATAACGAG TTTGGTGGCG 
841 AAGATATGAT TGATGCTTTA CTGAGGGTTA AGGAGAATAA TGAGCTTCAA TTTCCTATCG 
901 AAAATGACAA CATGAAAGCA GTAATTCTGG ACTTGTTTAT TGCTGGAACT GAAACTTCAT 
961 ATACTGCAAT TATATGGGCA CTATCAGAAT TGATGAAGCA CCCAAGTGTT ATGGCCAAGG 
1021 CACAAGCTGA AGTGAGAAAA GTCTTCAAAG AAAATGAAAA CTTGGACGAA AATGATCTTG 
1081 ACAAGTTGCC ATACTTAAAA TCAGTGATCA AAGAAACACT AAGGATGCAT CCTCCAGTTC 
1141 CTTTATTAGG ACCTAGAGAA TGCAGAGAAC AAACTGAGAT TGATGGATAT ACTGTACCTC 

12 01 TTAAAGCTAG AGTAATGGTT AATGCATGGG CAATTGGAAG AGATCCTGAA AGTTGGGAAG 
1261 ATCCTGAAAG TTTCAAACCC GAGCGATTTG AAAATATTTC TGTTGATCTT ACGGGAAATC 
1321 ACTATCAGTT CATTCCTTTC GGTTCAGGAA GAAGAATGTG TCCAGGAATG TCGTTTGGTT 

13 81 TAGTTAACAC TGGGCATCCT TTAGCTCAGT TGCTCTATTT CTTTGACTGG AAATTCCCTC 
1441 ATAAGGTTAA TGCAGCTGAT TTTCACACTA CTGAAACAAG TAGAGTTTTT GCAGCAAGCA 
1501 AAGATGACCT CTACTTGATT CCAACAAATC ACATGGAGCA AGAGTAGCTC TAAATTGAAT 
1561 TCTTGTCTTG GAACGATAAA AGAAGAAACT CCAGCTTGGT CTACATTATT TCTTTTTGCT 
1621 TTATATTAGT ATGGGTGTGT TCAGTTTCTT GTTTTTAAGG GTACCCTGAA AGATAAAGGG 
1681 CTATATAAAC CAGTGAGACT TTTTATTGAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

SEQ. ID. NO. 206 

1 l^IQSSPFiqTL lALLLFISFL FILLKKWNTK IPKLPPGPWR LPLIGSLHHL KGKLPHHHLR 
61 DLARKYGPLM YLQLGEVPW VISSPRIAKA VLKTHDIiAFA TRPRFMSSDI VFYKSRDISF 
121 APYGDYWRQM RKILTQELLS NKMLKSFSTI RKDELSKLLS SIRLATASSA VNINEKLLWF 
181 TSCMTCRLAF GKICNDRDEL IMLIREILAL SGGFDVCDLF PSWKLLHNMS NMKARLTNVH 
241 HKYNLIMENI INEHKENHAA GIKGNNEFGG EDMIDALLRV KENNELQFPI ENDNMKAVIL 
301 DLFIAGTETS YTAIIWALSE LMKHPSVMAK AQAEVRKVFK ENENLDENDL DKLPYLKSVI 
3 61 KETLRMHPPV PLLGPRECRE QTEIDGYTVP LKARVimJAW AIGRDPESWE DPESFKPERF 
421 ENISVDLTGN HYQFIPFGSG RRMCPGMSFG LVNTGHPLAQ LLYLFDWKFP HKVNAADFHT 
481 TETSRVFAAS KDDLYLIPTN HMEQE 
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NAME D207-AC4 
ORGATUSM NICOTIANA TABACUM 
SEQ. ID. NO. 207 

1 AACCAACCTT CCTTTTCTTA CTTAGTAAAA TGGATATTCA GTCTTCTCCT TTCAACTTAA 
61 TTGCTTTGCT ACTCTTCATT TCATTTCTTT TTATCCTATT GAAAAAGTGG AATACCAAAA 
121 TCCCAAAGTT ACCTCCAGGT CCATGGAGAC TTCCCCTTAT TGGCAGCCTC CATCACTTGA 
181 AAGGTAAACT CCCACACCAT CATCTTAGAG ATTTAGCCCG AAAATATGGA CCTCTCATGT 
241 ATTTACAACT TGGAGAAGTT CCTGTAGTTG TAATATCTTC GCCACGTATA GCAAAAGCTG 
3 01 TACTAAAAAC TCATGATCTT GCTTTTGCAA CGAGGCCTCG GTTCATGTCC TCGGACATTG 
361 TGTTTTACAA AAGCAGGGAC ATATCATTCG CCCCATATGG CGATTACTGG AGACAAATGC 
421 GTAAAATATT AACACAAGAA CTCTTGAGTA ACAAGATGCT CAAGTCATTT AGCACAATCC 
481 GAAAGGATGA GCTCTCGAAG CTCCTCTCGT CGATTCGTTT AGCAACAGCT TCTTCTGCAG 
541 TGAACATAAA CGAAAAGCTT CTCTGGTTTA CAAGTTGCAT GACTTGTAGA TTAGCCTTTG 
601 GAAAAATATG CAACGATCGT GATGAATTGA TTATGTTAAT AAGGGAGATA TTAGCATTAT 
661 CAGGAGGATT TGATGTGTGT GATTTGTTCC CTTCATGGAA ATTACTTCAC AATATGAGCA 
721 ACATGAAAGC TAGATTGACG AATGTTCACC ATAAGTATAA TCTAATTATG GAGAATATCA 

7 81 TCAATGAGCA CAAAGAGAAT CATGCAGCAG GGATAAAGGG AAATAACGAG TTTGGTGGCG 

8 41 AAGATATGAT TGATGCTTTA CTGAGGGTTA AGGAGAATAA TGAGCTTCAA TTTCCTATCG 
901 AAAATGACAA CATGAAAGCA GTAATTCTGG ACTTGTTTAT TGCTGGAACT GAAACTTCAT 
961 ATACTGCAAT TATATGGGCA CTATCAGAAT TGATGAAGCA CCCAAGTGTT ATGGCCAAGG 

1021 CACAAGCTGA AGTGAGAAAA GTCTTCAAAG AAAATGAAAA CTTGGACGAA AATGATCTTG 
1081 ACAAGTTGCC ATACTTAAAA TCAGTGATCA AAGAAACACT AAGGATGCAT CCTCCAGTTC 
1141 CTTTATTAGG ACCTAGAGAA TGCAGAGAAC AAACTGAGAT TGATGGATAT ACTGTACCTC 
12 01 TTAAAGCTAG AGTAATGGTT AATGCATGGG CAATTGGAAG AGATCCTGAA AGTTGGGAAG 
12 61 ATCCTGAAAG TTTCAAACCC GAGCGATTTG AAAATATTTC TGTTGATCTT ACGGa\AATC 
1321 ACTATCAGTT CATTCCTTTC GGTTCAGGAA GAAGAATGTG TCCAGGAATG TCGTTTGGTT 
1381 TAGTTAACAC TGGGCATCCT TTAGCTCAGT TGCTCTATCT CTTTGACTGG AAATTCCCTC 
1441 ATAAGGTTAA TGCAGCTGAT TTTCACACTA CTGAAACAAG TAGAGTTTTT GCAGCAAGCA 
1501 AAGATGACCT CTACTTGATT CCAACAAATC ACATGGAGCA AGAGTAGCTC TAAATTGAAT 
1561 TCTTGTCTTG GAACAATAAA AGAAGAAACT CCAGCTTGGT CTACATTATT TCCTTTTGCT 
1621 TTATATTAGT ATGGGTGTGT TCAGTCTCTT GTTTTTAAGG GTACCCTGAA AGATAAAGGG 
1681 CTATATAAAC CAGTGAGACT TTTTATTGGT TGCAAGGTTT TAGATCAAGC CATAAGACAG 
1741 CATATTTTAT TCCACCATTT TCTATCATGT TTAATAAAGT TCCTTTCGTT TATTGTTAGA 
1801 AAAAAAAAAA AAAAAAAAAA AAA 



SEQ. ID. NO. 208 

1 l^IQSSPFNIi lALLLFISFL FILLKKWNTK IPKLPPGPWR LPLIGSLHHL KGKLPHHHLR 

61 DLARKYGPLM YLQLGEVPW VISSPRIAKA VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 

121 APYGDYWRQM RKILTQELLS I^TKMLKSFSTI RKDELSKLLS SIRLATASSA VNINEKLLWF 

181 TSCMTCRLAF GKICNDRDEL IMLIREILAL SGGFDVCDLF PSWKLLHNMS NMKARLTNVH 

241 HKYNLIMENI INEHKENHAA GIKGNNEFGG EDMIDALLRV KENNELQFPI ENDNMKAVIL 

301 DLFIAGTETS YTAIIWALSE LMKHPSVMAK AQAEVRKVFK ENENLDENDL DKLPYLKSVT 

3 61 KETLRldHPPV PLLGPRECRE QTEIDGYTVP LKARVMVNAW AIGRDPESWE DPESFKPERF 

421 ENISVDLTGN HYQFIPFGSG RRMCPGMSFG LVNTGHPLAQ LLYLFDWKFP HKVl^AADFHT 

481 TETSRVFAAS KDDLYLIPTN HMEQE 
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NAME D209-AA10 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 209 

1 ATATGCAACT GAGATTTGAA GAATACCAAC TAACCAAAAT GCAGTTCTTC AGCCTGGTTT 
61 CCATTTTCCT ATTTCTATCT TTCCTCTTTT TGTTAAGGGT ATGGAAGAAC TCCAATAGCC 
121 AAAGCAAAAA GTTGCCACCA GGTCCATGGA AACTACCAAT ACTAGGAAGT ATGCTTCATA 
181 TGGTTGGTGG ACTACCACAC CATGTCCTTA GAGATTTAGC CAAAAAATAT GGACCACTTA 
241 TGCACCTTCA ATTAGGTGAA GTTTCTGCGG TTGTGGTTAC TTCTCCTGAT ACGGCAAAAG 
301 AAGTATTAAA AACTCATGAC ATCGCTTTTG CGTCTAGGCC TAGCCTTTTG GCCCCGGAQA 
361 TTGTCTGTTA CAATAGGTCT GATCTAGCCT TTTGCCCCTA TGGCGACTAT TGGAGACAAA 
421 TGCGTAAAAT ATGTGTCTTG GAAGTGCTCA GTGCCAAGAA TGTTCGGACA TTTAGCTCTA 
4 81 TTAGGCGGAA TGAAGTTCTT CGTCTCATTA ATTTTATCCG GTCATCTTCT GGTGAACCTA 
541 TTAATGTTAC GGAAAGGATC TTTTTGTTCA CAAGCTCCAT GACATGTAGA TCAGCGTTTG 
601 GGCAAGTGTT CAAAGAGCAA GACAAATTTA TACAACTAAT TAAAGAAGTG ATACTCTTAG 
661 CAGGAGGGTT TGATGTGGCT GACATATTCC CTTCACTGAA GTTTCTTCAT GTGCTCAGTG 
721 GAATGAAGGG TAAGATTATG AATGCACACC ATAAGGTAGA TGCCATTGTT GAGAATGTCA 
781 TCAATGAGCA CAAGAAAAAT CTTGCAATTG GGAAAACTAA TGGAGCGTTA GGAGGTGAAG 
841 ATTTAATTGA TGTTCTTCTA AGACTTATGA ATGATGGAGG CCTTCAATTT CCTATCACCA 
901 ACGACAACAT CAAAGCTATA ATTTTTGACA TGTTTGCTGC CGGGACAGAG ACTTCATCGT 
961 CAACAATTGT GTGGGCTATG GTAGAAATGG TGAAAAATCC AGCCGTATTC GCGAAAGCTC 
1021 AAGCAGAAGT AAGAGAAGCA TTTAGAGGAA AAGAAACTTT CGATGAAAAT GATGTGGAGG 
1081 AGCTAAACTA CCTAAAGTTA GTAATAAAAG AAACTCTAAG ACTTCATCCA CCGGTTCCAC 
1141 TTTTGCTCCC AAGAGAATGT AGGGAAGAGA CAAATATAAA CGGCTACACT ATTCCTGTAA 
1201 AGACCAAAGT CATGGTTAAT GTTTGGGCTT TGGGA;^.GAGA TCCAAAATAT TGGAATGACG 
1261 CAGAAACTTT TATGCCAGAG AGATTTGAGC agtgctctaa ggattttgtt ggtaataatt 
1321 ttgaatatct tccatttggt ggcggaagga ggatttgtcc tgggatttcg tttggcttag 
1381 ctaatgctta tttgccattg gctcaattac tatatcactt cg attggaaa ctccctgctg 
1441 gaatcgaacc aagcgacttg gacttgactg agttggttgg agtaactgcc gctagaaaaa 
1501 gtgaccttta cttggttgcg actccttatc aacctcctca aaagtgattt aatggtttca 
1561 agtttttatt tcctagcaaa ccccactatt gtcctatctt tcttttggtg ttttcggttt 
1621 tatctactct aatacatgca tcttttacca tataggaatg taccatgttg tog 

SEQ. ID. NO. 210 

1 MQLRFEEYQL TKMQFFSLVS IFLFLSFLFL LRVWKNSNSQ SKKLPPGPWK LPILGSMLHM 

61 VGGLPHHVLR DLAKKYGPLM HLQLGEVSAV WTSPDTAKE VLKTHDIAFA SRPSLLAPEI 

121 VCYNRSDLAF CPYGDYVTRQM RKXCVLEVLS AKNVRTFSSI RRNEVLRLIN FIRSSSGEPI 

181 NVTERIFLFT SSMTCRSAFG QVFKEQDKFI QLIKEVILLA GGFDVADIFP SLKFLHVLSG 

241 MKGKIMNAHH KVDAIVENVI IIEHKKNLAIG KTNGALGGED LIDVLLRLMN DGGLQFPITN 

301 DNIKAIIFDM F7KAGTETSSS TIVWAMVEMV KNPAVFAKAQ AEVREAFRGK ETFDENDVEE 

361 LNYLKLVIKE TLRLHPPVPL LLPRECREET NINGYTIPVK TKVMVNWJAL GRDPK^/WNDA 

421 ETFMPERFEQ CSKDFVGNNF EYLPFGGGRR ICPGISFGLA NAYLPLAQLL YHFDWKLPAG 

481 lEPSDLDLTE LVGVTAARKS DLYLVATPYQ PPQK 
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NAME D209-AA12 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO, 211 

1 ATATGCAACT GAGATTTGAA GAATACCAAC TAACCAAAAT GCAGTTCTTC AGCTTGGTTT 
61 CCATTTTCCT ATTTCTATCT TTCCTCTTTT TGTTAAGGAT ATGGAAGAAC TCCAATAGCC 
121 AAAGCAAAAA GTTGCCACCA GGTCCATGGA AACTACCAAT ACTAGGAAGT ATGCTTCATA 
181 TGGTTGGTGG ACTACCACAC CATGTCCTTA GAGATTTAGC CAAAAAATAT GGACCACTTA 
241 TGCACCTTCA ATTAGGTGAA GTTTCTGCGG TTGTGGTTAC TTCTCCTGAT ACGGCAAAAG 

3 01 AAGTATTAAA AACTCATGAC ATCGCTTTTG CGTCTAGGCC TAGCCTTTTG GCCCCGGAGA 
361 TTGTCTGTTA CAATAGGTCT GATCTAGCCT TTTGCCCCTA TGGCGACTAT TGGAGACAAA 
421 TGCGTAAAAT ATGTGTCTTG GAAGTGCTCA GTGCCAAGAA TGTTCGGACA TTTAGCTCTA 

4 81 TTAGGCGGAA TGAAGTTCTT CGTCTCATTA ATTTTATCCG GTCATCTTCT GGTGAACCTA 
541 TTAATGTTAC GGAAAGGATC TTTTTGTTCA CAAGCTCCAT GACATGTAGA TCAGCGTTTG 
601 GGC AAGTGTT CAAAGAGCAA GACAAATTTA TACAACTAAT TAAAGAAGTG ATACTCTTAG 
661 CAGGAGGGTT TGATGTGGCT GACATATTCC CTTCACTGAA GTTTCTTCAT GTGCTCAGTG 
721 GAATGAAGGG TAAGATTATG AATGCACACC ATAAGGTAGA TGCCATTGTT GAGAATGTCA 
7 81 TCAATGAGCA CAAGAAAAAT CTTGCAATTG GGAAAACTAA TGGAGCGTTA GGAGGTGAAG 
841 ATTTAATTGA TGTTCTTCTA AGACTTATGA ATGATGGAGG CCTTCAATTT CCTATCACCA 
901 ACGACAACAT CAAAGCCATA ATTTTTa^^CA TGTTTGCTGC CGGGACAGAG ACTTCATCGT 
961 CAACAATTGT GTGGGCTATG GTAGAAATGG TGAAAAATCC AGCCGTATTC GCGAAAGCTC 

1021 AAGCAGAAGT AAGAGAAGCA TTTAGAGGAA AAGAAACTTT CGATGAAAAT GATGTGGAGG 
1081 AGCTAAACTA CCTAAAGTTA GTAATAAAAG AAACTCTAAG ACTTCATCCA CCGGTTCCAC 
1141 TTTTGCTCCC AAGAGAATGT AGGGAAGAGA CAAATATAAA CGGCTACACT ATTCCTGTAA 
1201 AGACCAAAGT CATGGTTAAT GTTTGGGCTT TGGGAAGAGA TCCAAAATAT TGGAATGACG 

12 61 CAGAAACTTT TATGCCAGAG AGATTTGAGC AGTGCTCTAA GGATTTTGTT GGTAATAATT 
1321 TTGAATATCT TCCATTTGGT GGCGGAAGGA GGATTTGTCC TGGGATTTCG TTTGGCTTAG 

13 81 CTAATGCTTA TTTGCCATTG GCTCAATTAC TATATCACTT CGATTGGAAA CTCCCTGCTG 
1441 GAATCGAACC AAGCGACTTG GACTTGACTG AGTTGGTTGG AGTAACTGCC GCTAGAAAAA 
15 01 GTGACCTTTA CTTGGTTGCG ACTCCTTATC AACCTCCTCA AAAGTGATTT AATGGTTTCA 
1561 AGTTTTTATT TCCTAGCAAA CCCCACTATT GTCCTATCTT TCTTTTGGTG TTTTCGGTTT 
1621 TATCTACTCT AATACATGCA TCTTTTACCA TATAGGAATG TACCATGTTG TCG 

SEQ. ID. NO. 212 

1 MQLRFEEYQL TKMQFFSLVS IFLFLSFLFL LRIWKNSNSQ SKKLPPGPWK LPILGSMLHM 
61 VGGLPHHVLR DLAKKYGPLM HLQLGEVSAV WTSPDTAKE VLKTHDIAFA SRPSLLAPEI 
121 VCYNRSDLAF CPYGDYWRQM RKICVLEVLS AKNVRTFSSI RRNEVLRLIN FIRSSSGEPI 
181 NVTERIFLFT SSMTCRSAFG QVFKEQDKFI QLIKEVILLA GGFDVADIFP SLKFLHVLSG 

2 41 MKGKIMNAHH KVDAIVENVI ITEHKKNLAIG KTNGALGGED LIDVLLRLMN DGGLQFPITN 

3 01 DNIKAIIFDM FAAGTETSSS TIVWAMVEMV KNPAVFAKAQ AEVREAFRGK ETFDENDVEE 
3 61 LNYIiKLVIKE TLRLHPPVPL LLPRECREET NINGYTIPVK TKVl'lVNVWAL GRDPKYWNDA 
421 ETFMPERFEQ CSKDFVGNNF EYLPFGGGRR ICPGISFGLA NAYLPLAQLL TOFDWKLPAG 
481 lEPSDLDLTE LVGVTAARKS DLYLVATPYQ PPQK 
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NAME D209-AH10 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 213 

1 ATATGCAACT GAGATTTGAA GAATACCAAC TAACCAAAGT GCAGTTCTTC AGCTTGGTTT 
61 CCATTTTCCT ATTTCTATCT TTCCTCTTTT TGTTAAGGAT ATGGAAGAAC TCCAATAGCC 
121 AAAGCAAAAA GTTGCCACCA GGTCCATGGA AACTACCAAT ACTAGGAAGT ATGCTTCATA 
181 TGGTTGGTGG ACTACCACAC CATGTCCTTA GAGATTTAGC CAAAAAATAT GGACCACTTA 
241 TGCACCTTCA ATTAGGTGAA GTTTCTGCGG TTGTGGTTAC TTCTCCTGAT ACGGC AAAAG 
3 01 AAGTATTAAA AACTCATGAC ATCGCTTTTG CGTCTAGGCC TAGCCTTTTG GCCCCGGkGA 
3 61 TTGTCTGTTA CAATAGGTCT GATCTAGCCT TTTGCCCCTA TGGCGACTAT TGGAGACAAA 
421 TGCGTAAAAT ATGTGTCTTG GAAGTGCTCA GTGCCAAGAA TGTTCGGACA TTTAGCTCTA 
481 TTAGGCGGAA TGAAGTTCTT CGTCTCATTA ATTTTATCCG GTCATCTTCT GGTGAACCTA 
541 TTAATGTTAC GGAAAGGATC TTTTTGTTCA CAAGCTCCAT GACATGTAGA TCAGCGTTTG 
601 GGCAAGTGTT CAAAGAGCAA GACAAATTTA TACAACTAAT TAAAGAAGTG ATACTCTTAG 
661 CAGGAGGGTT TGATGTGGCT GACATATTCC CTTCACTGAA GTTTCTTCAT GTGCTCAGTG 
7 21 GAATGAAGGG TAAGATTATG AATGCACACC ATAAGGTAGA TGCCATTGTT GAGAATGTCA 
7 81 TCAATGAGCA CAAGAAAAAT CTTGCAATTG GGAAAACTAA TGGAGCGTTA GGAGGTGAAG 
841 ATTTAATTGA TGTTCCTCTA AGACTTATGA ATGATGGAGG CCTTCAATTT CCTATCACCA 
901 ACGACAACAT CAAAGCTATA ATTTTTGACA TGTTTGCTGC CGGGACGGAG ACTTCATCGT 
961 CAACAATTGT GTGGGCTATG GTAGAAATGG TGAAAAATCC AGCCGTATTC GCGAAAGCTC 
1021 AAGCAGAAGT AAGAGAAGCA TTTAGAGGAA AAGAAACTTT CGATGAAAAT GATGTGGAGG 
1081 AGCTAAACTA CCTAAAGTTA GTAATAAAAG AAACTCTAAG ACTTCATCCA CCGGTTCCAC 
1141 TTTTGCTCCC AAGAGAATGT AGGGAAGAGA CAAATATAAA CGGCTACACT ATTCCTGTAA 
12 01 AGACCAAAGT CATGGTTAAT GTTTGGGCTT TGGGAAGAGA TCCAAAATAT TGGAATGACG 

12 61 CAGAAACTTT TATGCCAGAG AGATTTGAGC AGTGCTCTAA GGATTTTGTT GGTAATAATT 
1321 TTGAATATCT TCCATTTGGT GGCGGAAGGA GGATTTGTCC TGGGATTTCG TTTGGCTTAG 

13 81 CTAATGCTTA TTTGCCATTG GCTCAATTAC TATATCACTT CGATTGGAAA CTCCCTGCTG 
1441 GAATCGAACC AAGCGACTTG GACTTGACTG AGTTGGTTGG AGTAACTGCC GCTAGAAAAA 
1501 GTGACCTTTA CTTGGTTGCG ACTCCTTATC AACCTCCTCA AAAGTGATTT AATGGTTTCA 
1561 AGTTTTTATT TCCTAGCAAA CCCCACTATT GTCCTATCTT TCTTTTGGTG TTTTCGGTTT 
1621 TATCTACTCT AATACATGCA TCTTTTACCA TATAGGAATG TACCATGTTG TCG 

SEQ. ID. NO. 214 

1 MQLRFEEYQL TKVQFFSLVS IFLFLSFLFL LRIWKNSNSQ SKKLPPGPWK LPILGSMLHM 
61 VGGLPHHVLR DLAKKYGPLM HLQLGEVSAV WTSPDTAKE VLKTHDIAFA SRPSLLAPEI 
121 VCYNRSDLAF CPYGDYVrRQM RKICVLEVLS AKNVRTFSSI RRl^TEVLRLIN FIRSSSGEPI 
181 NVTERIFLFT SSMTCRSAFG QVFKEQDKFI QLIKEVILLA GGFDVADIFP SLKFLHVLSG 
241 MKGKIMNAHH KVDAIVENVI ITEHKKNLAIG KTNGALGGED LIDVPLRLMN DGGLQFPITN 
301 DNIKAIIFDM FAAGTETSSS TIVWAMVEMV KIJPAVFAKAQ AEVREAFRGK ETFDENDVEE 
361 LNYLKLVIKE TLRLHPPVPL LLPRECREET NINGYTIPVK TKVMVNVWAL GRDPKYWNDA 
421 ETFMPERFEQ CSKDFVGNNF EYLPFGGGRR ICPGISFGLA NAYLPLAQLL YHFDWKLPAG 
481 lEPSDLDLTE LVGVTAARKS DLYLVATPYQ PPQK 
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NAME D87A-AF3 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 215 

1 GAAATGGGAA ATGCTCACAA CAGCAAAATT GCAGCAATCT GTTTGATAAT TTTCTTGGTA 
61 TATAAAGCAT GGGAATTGTT GAAGTGGATA TGGATTAAGC CAAAGAAACT GGAGAGTTGC 
121 CTCAGAAAAC AGGGACTCAA AGGAAATTCC TACAGGCTAT TCTATGGAGA TATGAAAGAA 
181 TTGTCCAAAA GTCTCAAGGA AATCAATTCA AAGCCCATCA TCAATCTATC AAATGAAGTA 

2 41 GCCCCAAGAA TCATTCCTTA TTATCTTGAA ATCATCCAAA AATATGGTAA AAGATGTTTT 

3 01 GTTTGGCAAG GACCAACCCC CGC AATATTA ATAACAGAGC CAGAATTAAT AAAGGAGATA 
361 TTTGGTAAGA ACTATGTTTT TCAGAAGCCT AATAATCCCA ACCCACTGAC CAAGTTATTG 
421 GCTCGAGGTG TTGTAAGCTA CGAGGAAGAA AAATGGGCAA AACACAGAAA GATCTTAAAC 
481 CCTGCCTTTC ATATGGAGAA GTTGAAGCAT ATGCTACCAG CATTTTACTT GAGCTGTAGT 
541 GAGATGCTGA ACAAATGGGA GGAGATTATC CCAGTAAAAG AATCAAATGA GTTGGACATT 
601 TGGCCTCATC TTCAAAGAAT GACAAGTGAT GTGATTTCTC GTGCTGCCTT TGGTAGTAGC 
661 TACGAAGAAG GAAGAAGAAT ATTTGAACTT CAAGAAGAAC AAGCTGAGTA TCTAACGAAG 
721 ACATTCAATT CAGTTTATAT CCCAGGTTCC AGATTTTTTC CCAATAAAAT GAACAAAAGA 
7 81 ATGAAAGAAT GTGAAAAGGA AGTACGAGAA ACAATTACGT GTCTAATTGA CAACAGATTA 
841 AAGGCAAAAG AAGAAGGCAA TGGCAAGGCC CTCAATGATG ACCTACTGGG TATATTATTA 
901 GAGTC.\AATT CTATAGAAAT TGAAGAACAT GGTAACAAGA AGTTTGGAAT GAGTATACCT 
961 GAAGTAATTG AAGAGTGCAA ATTATTCTAT TTTGCTGGCC AAGAGACTAC ATCAGTATTG 

1021 CTTGTGTGGA CACTGATTTT GTTAGGGAGA AATCCAGAAT GGCAGGAACG TGCTAGAGAG 
1081 GAAGTTTTTC AAGCCTTTGG AAGTGATAAA CCAACTTTTG ACGAATTATA TCGCTTGAAA 
1141 ATTGTGACGA TGATTTTGTA CGAGTCTTTA AGGTTATATC CACCAATAGC AACTCGTACT 
1201 CGAAGGACTA ATGAAGAAAC AAAATTAGGG GAACTAGATT TACCAAAGGG TGCACTGCTC 

12 61 TTTATACCAA CAATCTTATT ACATCTTGAC AAGGAAATTT GGGGTGAAGA TGCAGATGAG 

13 21 TTCAATCCGG AGAGATTTAG CGAAGGGGTG GCAAAGGCAA CAAAGGGGAA AATGACATAT 
13 81 TTTCCATTTG GTGCAGGACC GCGAAAATGC ATTGGGCAAA ACTTCGCGAT TTTGGAAGCA 
1441 AAAATGGCTA TAGCTATGAT TCTACAACGC TTCTCCTTCG AGCTCTCTCC ATCTTATACA 
1501 CACTCTCCAT ACACTGTGGT CACTTTGAAA CCCAAATATG GTGCTCCCCT AATAATGCAC 
1561 AGGCTGTAGT CCTGTGAGAA 

SEQ, ID. NO. 216 

1 MGNAHNSKIA AICLIIFLVY KAWELLKWIW IKPKKLESCL RKQGLKGNSY RLFYGDMKEL 
61 SKSLKEINSK PIINLSNEVA PRIIPYYLEI IQKYGKRCFV WQGPTPAILI TEPELIKEIF 
121 GIQ>TVVFQKPN NPNPLTKLLA RGWSYEEEK WAKHRKILNP AFHMEKLKHM LPAFYLSCSE 
181 l^NKWEEIIP VKESNELDIW PHLQRMTSDV ISRAAFGSSY EEGRRIFELQ EEQAEYLTKT 

2 41 FNSVYIPGSR FFPNKMNKRM KECEKEVRET ITCLIDNRLK AKEEGNGKAL NDDLLGILLE 

3 01 SNSIEIEEHG l^KKFGMSIPE VIEECKLFYF AGQETTSVLL VWTLILLGRN PEWQERAREE 
3 61 VFQAFGSDKP TFDELYRLKI VTMILYESLR LYPPIATRTR RTNEETKLGE LDLPKGALLF 
421 IPTILLHLDK EIWGEDADEF NPERFSEGVA KATKGKMTYF PFGAGPRKCI GQNFAILEAK 
481 MAIAMILQRF SFELSPSYTH SPYTWTLKP KYGAPLIMHR L 
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NAME D208-AC8 
ORGATJISM NICOTIANA TABACUM 
SEQ, ID. NO. 217 

1 ATGCTTTCTC 



61 CTATGGACAA 
121 TGGCCGGTAA 
181 CGAAAGCTCG 
241 CCCCTTGTGC 
301 GCCATTTTCT 
361 ATGCTTTTTC 
421 GAAGTTCTCT 
481 ACGAGCATTA 
541 GATTGGTTAG 
601 GAATCCGGTA 
661 GTTTTATCAA 
721 GATTTTCAAG 
781 CAGAACTGGT 
841 AATa\ACAAG 
901 GGTTACTCTC 
961 GACACAGTTG 
1021 TTGATGAAAG 
1081 AGTGATATTA 
1141 CCACCAGGAC 
1201 CACATTCCTA 
1261 CTCTTGTCAA 
1321 CGTGGTCACC 
13 81 ACTTATGCAT 
1441 AAAACTCCAA 
1501 GTAAATCCAG 
1561 AGATGTTTCA 



CCATAGAAGC 
AAAAATCTCA 
TCGGCCATCT 
GAGACTTAGC 
TAGTTGTAAG 
CCAATCGTCC 
TAGCAAATTA 
CTGCTAGTCG 
AGAATTTATA 
AAGAATTGAA 
AAGGAGATGA 
TGGAATTTGT 
GTCATATTAA 
TAGAGGAACA 
ATTTCATTGA 
GTGATACTGT 
CTCTTCACAT 
CACAAGAAa^^ 
AGGATTTAGT 
CTTTGTTAGT 
AAGGGACTAG 
ATCCTGATAA 
ACTATGAGTT 
TGCAAGTGGA 
ATGACGAGGC 
TGGAATTGAT 
TCTTGGTTGA 



CTTTGTAGGA 
AAAACTTCCA 
TTTTCACTTC 
TGATAAATAC 
CAGTTACGAA 
AGCTCTTCTT 
CGGACCTTAC 
TCTCGAAAAA 
CACTCGAATT 
TTTTGGTCTG 
ACAAGTGGAA 
ATTATGGGAT 
GGCAATGAAA 
TATTAATAAA 
TGTGGTGCTT 
CATTAAAGCA 
AAATTGGGGA 
GATAGACACA 
ATACCTCCAA 
ACCACATGAA 
ATTATTCGCA 
GTTCGATCCA 
TATCCCATTT 
ACACCTAACA 
CTTGGATATG 
AATAACGCCT 
TCATTGT 



CTAGTAACCT 
AAACCCTTAC 
AATAACGACG 
GGCCCCGTTT 
GCTATAAAAG 
TACGGCGAAT 
TGGCGAAAAA 
TTCAAACAAG 
AATGGAAATT 
ATCGTGAAAA 
AGATTTAAGA 
GCATTTCCAA 
AGGACATTTA 
AGAGAAAAAA 
TCAAAATTGA 
ACAGTTTTTA 
ATGACATTAT 
AAAGTTGGTA 
GCTATTGTTA 
AATGTAAAGG 
AACGTCATGA 
GAGAGATTCA 
GGTTCTGGAA 
ATGGCACATT 
AAGGAAGGTG 
CGCTTGGCAC 



TCACATTTCT 
CACCGAAAAT 
GCGACa^CCG 
TCACTTTTCG 
ATTGCTTCTC 
ACCTTGGCTA 
ATCGTAAATT 
TGAGATTCAC 
CGAGTACGAT 
TGATCGCTGG 
ATGCGTTTAA 
TTCCATTATT 
AGGATATAGA 
TAGAGGTTGG 
GTAAAGAATA 
GTTTGGTCTT 
TGATAAACAA 
AGGATAGATG 
AAAAGGTGTT 
ATTGTGTTGT 
AACTGCAGCG 
TCGCTGGTGA 
GACGATCTTG 
TAATCCAGGG 
CAGGCATAAC 
CTGAGCTTTA 



CTTATACTTC 
CCCCGGAGGA 
TCCATTAGCT 
GCTAGGTCTT 
TACAAATGAT 
CAATAATACA 
AGTCATTCAG 
CAGAATTCAA 
AAATCTAACT 
GAAAAATTAT 
GGATTTTATG 
TAAATGGGTG 
TTCTGTTTTT 
TGCAGAAGGG 
TCTTGATGAA 
GGATGCAGCA 
TCAAAATGCC 
GGTAGAAGAG 
ACGATTATAT 
TAGTGGATAT 
CGATCCTAAA 
TATTGACTTC 
TCCGGGGATG 
TTTCAATTAC 
AATACGTAAG 
CTAAAACCTA 



SEQ. ID, NO. 218 

1 l^ILSPIEAFVG LVTFTFLLYF LWTKKSQKIiP 
61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK 
181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE 
241 DFQGHIKAMK RTFKDIDSVF QIWLEEHINK 
301 GYSRDTVIKA TVFSLVLDAA DTVALHINWG 
361 SDIKDLVYLQ AIVKKVLRLY PPGPLLVPHE 
421 LLSNPDKFDP ERFIAGDIDF RGHHYEFIPF 
481 KTPNDEALDM KEGAGITIRK WPVELIITP 



KPLPPKIPGG WPVIGHLFHF NNDGDDRPLA 
AIKDCFSTND AIFSNRPALL YGEYLGYNNT 
FKQVRFTRIQ TSIKITLYTRI NGNSSTINLT 
RFKNAFKDFM VLSMEFVLWD AFPIPLFKWV 
REKIEVGAEG NEQDFIDWL SKLSKEYLDE 
MTLLINNQNA LMKAQEEIDT KVGKDRWVEE 
NVKDCWSGY HIPKGTRLFA l^VMKLQRDPK 
GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
RLAPELY 
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NAME D215-AB5 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 219 

1 GGGAGAAGGC 



61 TTGCAATTAT 
121 CAAAAGAATT 
181 TGTTTGGGGA 
241 ATTTCTCTCA 
3 01 ATTATGGTAA 
361 CGGAACTTGT 
421 CACTCACAAA 
481 ATAGAAGGCT 
541 TCCAATTTAC 
601 CAGAGATAGA 
661 CGTTTGGAAG 
721 CACTAATTCT 
7 81 AAAGGAACAA 
841 TTAAGAAAAG 
901 TATTGGCATC 
961 GTATTGATGA 
1021 CATCTTTACT 
1081 CTAGAGAAGA 
1141 AGCTAAAAAT 
12 01 TGATTAATCG 

12 61 TACAGCTCGT 

13 21 CAATGGAGTT 
13 81 TTGTGTTTTT 
1441 TAGAGGCTAA 
1501 CTTATGCTCA 
1561 TGTACAAGTT 
1621 AACCATATTG 



CTTCAATATG 
CTTTGTACTA 
GGAGAAATAC 
TATGAAAGAG 
TGACATGATT 
GAATTGTATT 
AAAGGAGGTG 
GTTGGCAGCA 
TCTCAATCCT 
TGCTAGTGAG 
TGTGTGGCCA 
TAGTTATGAA 
AGAAGTTTCA 
AAGGATGAAG 
GATGAGTATG 
CAATTTAAAA 
GGTGATTGAA 
TGTATGGACT 
GGTTTTGCAA 
AGTAACTATG 
AATGGTAAAC 
GTTACCAACA 
CAATCCAGAG 
TCCATTTAGT 
AATGGCAATG 
TGCTCCTCAT 
GTAGATATGG 
AGGG 



GAGATACCAT 
AGATGGGCAT 
CTCAGACAGC 
ACGAAGAAAA 
TGGCCTAGAG 
GTGTGGTTTG 
CTAACGAAGA 
ACTGGAATTG 
GCTTTTCACC 
ATGTTGAGCA 
TATTTACAAA 
GAAGGAAGAA 
CGCACAATAT 
CAAATATTTA 
ATTGAAAATG 
GAAATCCAAC 
GAGTGTAAAC 
ATGATTTTGT 
GTGTTTGGGA 
ATCTTAAACG 
AAAGAAACAA 
ATGTTGTTGC 
AGATTTAGTG 
TGGGGTCCAA 
GCTATGATTC 
CCACTACTAC 
TCAATCTGGA 



ATTACAGCTT 
GGAAAATCTT 
AGGGTTTCAA 
TGGGTGAAGA 
TTATGCCATT 
GGCCAAGACC 
ATTTCGTCTA 
CAGGCTATGA 
TTGACAAGTT 
AATTGGAGAA 
CTTTGACAAG 
AGATTTTTGA 
ATATTCCAGG 
ATGAAGTACG 
GAGAAGCACC 
AACATGGAAA 
TCTTCTATTT 
TGTGCAAATA 
GTAGGGAAGT 
AGGTCTTAAG 
AGTTAGGGAA 
AACATGATAC 
ATGGAATATC 
GAATATGTAT 
TGAAAACCTA 
TTCAACCTCA 
ACTTGTTATG 



AAAAATTGCA 
GAATTATGTG 
AGGAAACTCT 
AGCTATGTCT 
CATCCACAAA 
AGCAGTCCTG 
TCAGAAGCCG 
AACAGATAAA 
GAAGCATATG 
AGTTGTTTCA 
TGATGCCATT 
CCTTCAAAAA 
ATGGAGGTTT 
AGCACTGGTA 
TGATGATTTA 
CAACAAGAAA 
TGCTGGGCAA 
TCCTAATTGG 
TGACTATGAC 
GTTGTATCCA 
TTTGTGTTTA 
TGAAATATGG 
CAAAGCAACA 
TGGGC AAAAT 
TGCATTTGAA 
ATATGGTGCT 
GAACTTTTAT 



ATTTCTTCAT 
TGGTTAAAAC 
TACAAATrCT 
AAGCCAATCA 
ACCATCACCA 
ATCACAGACC 
CTTGGCAATC 
TGGGCTACAC 
CTACCTGCAT 
CCAAACGGAA 
TCAAGAACTG 
GAACAACTTT 
TTGCCAACGA 
TTTGGAATTA 
TTGGGAATAT 
TTTGGTATGA 
GAGACTACTT 
CAAGATAAAG 
AAGTTGAATC 
GCAGGATATG 
CCAGCCGGCG 
GGAGATGATG 
AAAGGAAAAC 
TTTGCTATGT 
CTCTCTCCAT 
CAATTAATTT 
CATCGTAATC 



SEQ. ID. NO. 220 

1 MEIPYYSLKI AISSFAIIFV LRWAWKILNY VWLKPKELEK YLRQQGFKGN SYKFLFGDMK 
61 ETKKMGEEAM SKPINFSHDM IWPRVl^PFIH KTITOTGKNC IVWFGPRPAV LITDPELVKE 
121 VLTKNFVYQK PLGNPLTKLA ATGIAGYETD KWATHRRLLN PAFHLDKLKH I'lLPAFQFTAS 
181 EMLSKLEKW SPNGTEIDVW PYLQTLTSDA ISRTAFGSSY EEGRKIFDLQ KEQLSLILEV 

2 41 SRTIYIPGWR FLPTKRNKRM KQIFNEVRAL VFGIIKKRMS MIENGEAPDD LLGILLASNL 

3 01 KEIQQHGNNK KFGMSIDEVI EECKLFYFAG QETTSSLLVW TMILLCKYPN WQDKAREEVL 
3 61 QVFGSREVDY DKLNQLKIVT MILNEVLRLY PAGYVINRMV NKETKLGNLC LPAGVQLVLP 
421 TMLLQHDTEI WGDDAMEFNP ERFSDGISKA TKGKLVFFPF SWGPRICIGQ NFAMLEAKMA 
481 MAMILKTYAF ELSPSYAHAP HPLLLQPQYG AQLILYKL 
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NAME D103-AH3 
ORGANISM NICOTIANA 
SEQ. ID. NO, 221 

1 ATGGTTTTTC 
61 CTATGGACAA 
121 TGGOCGGTAA 
181 CGAAAACTCG 
241 CCCCTTGTGC 
301 GCCATTTTCT 
361 ATGCTTTTTC 
421 GAAGTTCTCT 
481 ACGAGCATTA 
541 GATTGGTTAG 
601 GAATCCGGTA 
661 GTTTTATCAA 
721 GATTTTCAAG 
781 CAGAACTGGT 
841 AATGAACAAG 
901 GGTTACTCTC 
961 GACACAGTTG 
1021 TTGATGAAAG 
1081 AGTGATATTA 
1141 CCACCAGGAC 
1201 CACATTCCTA 
1261 CTCTTGTCAA 
1321 CGTGGTCACC 
1381 ACTTATGCAT 
1441 AAAACTCCAA 
1501 GTAAATCCAG 
1561 AGATCTTTCA 

SEQ. ID. NO. 222 

1 MVFPIEAFVG 
61 RKLGDLADKY 
121 MLFLANYGPY 
181 DWLEELNFGL 
241 DFQGHIKTMK 
301 GYSRDTVIKA 
361 SDIKDLVYLQ 
421 LLSNPDKFDP 
481 KTPNDEVLDM 



TABACUM 



CCATAaAAGC 
AAAAATCTCA 
TCGGCCACCT 
GAGACTTAGC 
TAGTTGTAAG 
CCAATCGTCC 
TAGCAAATTA 
CTGCTAGTCG 
AGAATTTATA 
AAGAATTGAA 
AAGGAGATGA 
TGGAATTTGT 
GTCATATTAA 
TAGAGGAACA 
ATTTCATTGA 
GTGATACTGT 
CTCTTCACAT 
CACAAGAAGA 
AGGATTTAGT 
CTTTGTTAGT 
AAGGGACTAG 
ATCCTGATAA 
ACTATGAGTT 
TGCAAGTGGA 
ATGACGAGGT 
TGGAATTGAT 
TCTTGGTTGA 



CTTTGTAGGA 
AAAACTTCCA 
TTTTCACTTC 
TGATAAATAC 
CAGTTACGAA 
AGCTTTTCTT 
CGGACCTTAC 
TCTCGAAAAA 
CACTCGAATT 
TTTTGGTCTG 
ACAAGTGGAA 
ATTATGGGAT 
GACAATGAAA 
TATTAATAAA 
TGTGGTGCTT 
CATTAAAGCA 
AAATTGGGGA 
GATAGACACA 
ATACCTCCAA 
ACCACATGAA 
ATTATTCGCA 
GTTCGATCCA 
TATCCCATCT 
ACACCTAACA 
CTTGGATATG 
AATAACGCCT 
TCATTGTTTA 



CTAGTAACCT 
AAACCCTTAC 
AATAACGACG 
GGCCCCGTTT 
GCTACAAAAG 
TACGGCGAAT 
TGGCGAAAAA 
TTCAAACAAG 
AATGGAAATT 
ATCGTGAAAA 
AGATTTAAGA 
GCATTTCCAA 
AGGACATTTA 
AGAGAAAAAA 
TCAAAATTGA 
ACAGTTTTTA 
ATGACATTAT 
AAAGTTGGTA 
GCTATTGTTA 
AATGTAAAGG 
AACGTCATGA 
GAGAGATTCA 
GGTTCTGGAA 
ATGGCACATT 
AAGGAAGGTG 
CGCTTGGCAC 
ATA 



TCACATTTCT 
CACCGAAAAT 
GCGACGACCG 
TCACTTTTCG 
ATTGCTTCTC 
ACCTTGGCTA 
ATCGTAAATT 
TGAGATTCAC 
CGAGTACGAT 
TGATCGCTGG 
ATGCGTTTAA 
TTCCATTATT 
AGGATATAGA 
TGGAGGTTGG 
GTAAAGAATA 
GTTTGGTCTT 
TGATAAACAA 
AGGATAGATG 
AAAAGGTGTT 
ATTGTGTTGT 
AACTGCAGCG 
TCGCTGGTGA 
GACGATCTTG 
TAATCCAGGG 
CAGGCATAAC 
CTGAGCTTTA 



CTTATACTTC 
CCCCGGAGGA 
TCCATTAGCT 
GCTAGGTCTT 
TACAAATGAC 
CAATAATACA 
AGTCATTCAG 
CAGAATTCAA 
AAATCTAACT 
GAAAAATTAT 
GGATTTTATG 
TAAATGGGTG 
TTCTGTTTTT 
TGCAGAAGGG 
TCTTGATGAA 
GGATGCAGCA 
TCAAAATGCC 
GGTAGAAGAG 
ACGATTATAT 
TAGTGGATAT 
CGATCCTAAA 
TATTGACTTC 
TCCGGGGATG 
TTTCAATTAC 
AATACGTAAG 
CTAAAACCTA 



LVTFTFLLYF 
GPVFTFRLGL 
WRKNRKIiVIQ 
IVKMIAGKNY 
RTFKDIDSVF 
TVFSLVLDAA 
AIVKKVLRLY 
ERFIAGDIDF 
KEGAGITIRK 



LVJTKKSQKLP 
PLVLWSSYE 
E\a:.SASRLEK 
ESGKGDEQVE 
QI>1WLEEHINK 
DTVALHINWG 
PPGPLLVPHE 
RGHHYEFIPS 
VNPVELIITP 



KPLPPKIPGG 
ATKDCFSTND 
FKQVRFTRIQ 
RFKNAFKDFM 
REKMEVGAEG 
MTLLINNQNA 
NVKDCWSGY 
GSGRRSCPGM 
RLAPELY 



WPVIGHLFHF 
AIFSNRPAFL 
TSIKNLYTRI 
VLSMEFVLWD 
NEQDFIDWL 
LMKAQEEIDT 
HIPKGTRLFA 
TYALQVEHLT 



NNDGDDRPLA 
YGEYLGYNNT 
NGNSSTINLT 
AFPIPLFKWV 
SKLSKEYLDE 
KVGKDRWVEE 
IWMKLQRDPK 
MAHLIQGFNY 
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NAME D208-AD9 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 223 

1 ATGCTTTCTC CCATAGAAGC CATTGTAGGA CTAGTAACCT TCACATTTCT CTTCTTCTTC 
61 CTATGGACAA AAAAATCTCA AAAACCTTCA AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC AATGACGACG GCGACGACCG TCCATTAGCT 
181 CGAAAACTCG GAGACTTAGC TGACAAATAC GGCCCCGTTT TCACTTTTCG GCTAGGCCTT 
241 CCCCTTGTCT TAGTTGTAAG CAGTTACGAA GCTGTAAAAG ACTGTTTCTC CACAAATGAC 
301 GCCATTTTTT CCAATCGTCC AGCTTTTCTT TACGGCGATT ACCTTGGCTA CAATAATGCC 
361 ATGCTATTTT TGGCCAATTA CGGACCTTAC TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA TTCAAACACG TGAGATTTGC AAGAATTCAA 
481 GCGAGCATGA AGAATTTATA TACTCGAATT GATGGAAATT CGAGTACGAT AAATTTAACT 
541 GATTGGTTAG AAGAATTGAA TTTTGGTCTG ATCGTGAAGA TGATCGCTGG AAAAAATTAT 
601 GAATCCGGTA AAGGAGATGA ACAAGTGGAG AGATTTAAGA AAGCGTTTAA GG ATTTTATG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT GC ATTTCCAA TTCCATTATT TAAATGGGTG 
721 GATTTTCAAG GGCATGTTAA GGCTATGAAA AGGACTTTTA AAGATATAGA TTCTGTTTTT 
781 CAGAATTGGT TAGAGGAACA TATTAATAAA AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT TCAAAAATGA GTAATGAATA TCTTGGTGAA 
901 GGTTACTCTC GTGATACTGT CATTGAAGCA ACGGTGTTTA GTTTGGTCTT GGATGCAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA ATGGCATTAT TGATAAACAA TCAAAAGGCC 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA AAAGTTTGTA AGGACAGATG GGTAGAAGAG 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA GCTATTGTTA AAGAAGTGTT ACGATTATAT 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA AATGTAGAAG ATTGTGTTGT TAGTGGATAT 
1201 CACATTCCTA AAGGGACAAG ATTATTCGCA AACGTCATGA AACTGCAACG TGATCCTAAA 
1261 CTCTGGTCTG ATCCTGATAC TTTCGATCCA GAGAGATTCA TTGCTACTGA TATTGACTTT 
1321 CGTGGTCAGT ACTATAAGTA TATCCCGTTT GGTCCTGGAA GACGATCTTG TCCAGGGATG 
1381 ACTTATGCAT TGCAAGTGGA ACACTTAACA ATGGCACATT TGATCCAAGG TTTCAATTAC 
1441 AGAACTCCAA ATGACGAGCC CTTGGATATG AAGGAAGGTG CAGGCATAAC TATACGTAAG 
1501 GTAAATCCTG TGGAACTGAT AATAGCGCCT CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 
1561 AGATGTTTCA TCTTGGTTGA 



SEQ. ID. NO. 224 

1 MLSPIEAIVG LVTFTFLFFP LWTKKSQKPS KPLPPKIPGG WPVXGHLFHF NDDGDDRPIiA 

61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 

121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK FKHVRFARIQ ASMKNLYTRI DGNSSTINLT 

181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 

241 DFQGHVKAMK RTFKDIDSVF QNWLEEHINK REKMEVNAEG NEQDFIDWL SKMSNEYLGE 

301 GYSRDTVIEA TVFSLVLDAA DTVALHINWG MALLINNQKA LTKAQEEIDT KVCKDRWVEE 

3 61 SDIKDLVYLQ AIVKEVLFILY PPGPLLVPHE NVEDCWSGY HIPKGTRLFA NVl'lKLQRDPK 

421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF GPGRRSCPGM TYALQVEHLT MAHLIQGFNY 

481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP RLAPELY 
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NAME D237-AD1 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 225 

1 TTTCATATAC 



61 AGAAGCCATT 
121 ACAATCAAAA 
181 CCATCTCTTT 
241 TGACTTAGCT 
301 GGCGGTGAGT 
361 CGATCGGCCA 
421 TGCCAAATAT 
481 CGTTAGTCGG 
541 ACAATTGTAT 
601 ATTGACATTC 
661 AGAAATACTC 
721 TGATGTTTTT 
781 GAAACAAACT 
841 GAAGAGAGAA 
901 TTCCAAGATG 
961 AACTGTATTC 
1021 AATGGCGTTA 
1081 AATTGTTGGT 
1141 AGCAATTGTC 
12 01 ATCTGTAGAA 

12 61 CAATATTATG 
1321 AGAGAGATTC 

13 81 CCCCTTTGGT 
1441 CCTTTCAATT 
1501 GGATATGAAA 
1561 TACACCTCGC 
1621 TCTGATCAAT 



CTTTAGTACT 
GTAGGATTTG 
ATCTTAAACC 
TATTTCAAGA 
GACAAATATG 
AGTTATGAAG 
TCTTTACTCT 
GGCCCTTACT 
CTCGAAAAAT 
AATTGTGATT 
GACATCATTT 
AAAGTTGCTT 
CACATTCCAT 
TTCAAAGACA 
ACAAAGGATG 
AGCGACGAAC 
ACTTTGGTCT 
ATGATAAACA 
AGAGATAGAT 
AAAGAAGTAT 
GATTGTGTTG 
AAACTACAGC 
CTGACGACTC 
ACGGGGAGAC 
GCTCATATGA 
CAAGGTGTGG 
CTTCCTCCTA 
CCCTCAATG 



CTTGAAATTT 
TAACCTTTTC 
CACTACCTCC 
ACAATGGCGA 
GTCCCGTCTT 
CTATGAAAaA 
ACGGAGAATA 
GGAAAAAAAA 
TCAAACATGT 
CACCAATGGT 
TGAAAATGGT 
TTCAGAAATT 
TTTTCAAGTG 
TTa^TAATAT 
TTGGAGGTGA 
ATCTTGGCGA 
TGGATGCAAC 
ATAAGCATGT 
GGGTAGAAGA 
TACGATTACA 
TCAATGGGTA 
GAGATCCTCA 
ATGCTACTAT 
GAGCTTGTCC 
TCCAAGGTTT 
GTTTAACTTT 
CGCTTTATCA 



TCAAATAATG 

ATTTCTATTC 

AAAAATCCCA 

TGAAGATCGC 

CACTTTCCGG 

ATGCTTCACT 

CCTTTGCTAT 

TCGAAAGTTA 

TAGATTTTCT 

GAAGATAAAC 

TGTTGGGAAG 

CATGGTTCAA 

GTTGGATCTT 

TATCCAAGGT 

AAACGAACAA 

GGGTTACTCT 

AGACACACTT 

CATGAAGAAA 

GAGTGATATC 

TCCACCCGCA 

CCATATTCCT 

AACATGGCCA 

TGACTACCGC 

CGCGATGAAT 

CAGTTTTGCA 

ACCAAAGAAG 

ATATTAAGAT 



GTTTATCTTC 
TACTTTCTAT 
GGTGGATGGC 
CATTTTTCTC 
TTAGGGTTTC 
ACCAATGATA 
AATAACGCJCA 
GTCAATCAAG 
ATAATTCAGA 
CTTAGTGATT 
AACTATAATA 
GCTATGGAGA 
ACAGGGAATA 
TGGTTAGATG 
GATTTTATAG 
CATGACACAA 
GCACTTCATA 
GCACAAGAAG 
AAGAATTTGG 
CCTTTGTCAG 
AAGGGGACTG 
AATCCTGATA 
GGGCAGCACT 
TATTCATTGC 
ACTACGACCA 
ACTGATGTTG 
GTTTTGTTGT 



TTTCTCCCAT 
GGACCAAAAA 
CAGTAATCGG 
AAAAACTCGG 
GCCGTTTCTT 
TCCATTTCGC 
TGCTTGCTGT 
AAGTTCTCTC 
AAAATATTAA 
GGATAGATAA 
ATGGACATGG 
TGGAGCTCTA 
TTAAGGCTAT 
AGCACATTAA 
ATGTGGTGCT 
CCATCAAAGC 
TAAAGTGGGT 
AGATGGACAC 
TGTATCTCCA 
TGCAACACCT 
CACTACTTAC 
AATTCGATCC 
ATGAGTCGAT 
AAGTGGAACA 
ATGAGCCTTT 
AAGTGCTAAT 
CGGGATTCGT 



SEQ. ID. NO. 226 

1 MVYLLSPIEA IVGFVTFSFL FYFLWTKKQS KILNPLPPKI PGGWPVIGHL FYFKNNGDED 

61 RHFSQKLGDL ADKYGPVFTF RLGFRRFLAV SSYEAMKECF TTNDIHFADR PSLLYGEYLC 

121 YNNAMLAVAK YGPYWKKNRK LVNQEVLSVS RLEKFKHVRF SIIQKNIKQL YNCDSPMVKI 

181 NLSDWIDKLT FDIILKMWG KmTlNGHGEI LKVAFQKFMV QAMEMELYDV FHIPFFKWLD 

241 LTGNIKAMKQ TFKDIDNIIQ GWLDEHIKKR ETKDVGGENE QDFIDWLSK MSDEHLGEGY 

301 SHDTTIKATV FTLVLDATDT LALHIKWVMA LMINNKHVMK KAQEEMDTIV GRDRWVEESD 

361 IKNLVYLQAI VKEVLRLHPP APLSVQHLSV EDCWNGYHI PKGTALLTNI MKLQRDPQTW 

421 PNPDKFDPER FLTTHATIDY RGQHYESIPF GTGRRACPAM NYSLQVEHLS lAHMIQGFSF 

481 ATTTNEPLDM KQGVGLTLPK KTDVEVLITP RLPPTLYQY 
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NAME D125-AF11 
ORGANISM NICOTIANA 
SEQ. ID. m, 227 

1 CTTTTTCTCC 
61 AGAGAAGACC 
121 TCGTTCAAAG 
181 GCTTCAAGTT 
241 TGATCTTTTC 
301 AGCTAAAGAA 

3 61 ATTTGATATT 
421 GAGAAAAATG 

4 81 TAGAGGGGGG 
541 ATCTGCTACT 
601 GTTTAGGATT 
661 TAAGGCTTTG 
721 TTTTATTCCC 
7 81 GGAGAAGAGG 
841 TACCAAGAGC 
901 ACAGAAGGGG 
961 TGCTATAGAA 

1021 TCACATCCAA 
1081 GACTGAACCA 
1141 TCTCCGTATG 
1201 CGGGTTTGAT 
1261 CCCGGCTCAT 
1321 GC ACGTTGAG 
13 81 TTGCCCTGGA 
1441 GAACTTTGAG 
1501 ACAGTTCAGT 
1561 AACTTTCTGA 

SEQ, ID, NO. 228 

1 MDLLLLEKTL 
61 FAKKFGDLFL 
121 VYGEHWRKMR 
181 MMYNNMFRIM 
241 ICKEVKEKRL 
301 ENINVAAIET 
3 61 IKETLRLRMA 
421 FFEEEKHVEA 
481 TTEKGGQFSL 



TABACUM 



CCAAAAAAGA 
TTAATTGGTC 
CGTTTTAAGC 
GGTGATGATT 
TTGTTAAGAA 
GTTTTACACA 
TTTACTGGAA 
AGGAGAATTA 
TGGGAGTTTG 
AATGGGATTG 
ATGTTTGATA 
AATGGTGAAA 
ATTTTGAGGC 
CTGCAGCTTT 
TTGGACAGCA 
GAGATCAATG 
ACCACATTAT 
AAGAAACTCC 
GACACCCACA 
GCAATTCCTC 
ATTCCAGCAG 
TGGAAGAAAC 
GCCAATGGCA 
ATTATACTTG 
CTGTTGCCTC 
CTCCATATTT 
TCCTAATCAA 



GCTCATTTCC 
TCTTCTTTGC 
TTCCCCCAGG 
TAAACCACAG 
TGGGCCAGCG 
CACAAGGTGT 
AAGGTCAAGA 
TGACTGTACC 
AAGTGGCAAG 
TATTAAGGAG 
GGAGATTTGA 
GGAGTAGATT 
CTTTTTTGAG 
TCAAAGATTA 
ATGCTCTGAA 
AGGACAACGT 
GGTCAATTGA 
GCGACGAGAT 
AGCTTCC?^TA 
TATTAGTCCC 
AGAGCAAAAT 
CCGAAGAGTT 
ATGACTTCAG 
CATTGCCAAT 
CTCCAGGCCA 
TGAAGCATTC 
TTAAGGGGTT 



CTTGTCCCCA 

CATTTTAATC 

ACCAATCCCA 

AAATCTTACT 

TAATTTAGTT 

TGAATTTGGT 

TATGGTTTTT. 

ATTTTTTACT 

TGTAATTGAG 

GAGATTACAA 

GAGTGAAGAT 

GGCTCAGAGT 

AGGTTATTTG 

CTTTGTTGAT 

ATGTGCGATT 

TCTTTACATT 

GTGGGGTATC 

TGACACAGTT 

CCTTCAGGCT 

ACACATGAAC 

CTTGGTTAAC 

CAGACCCGAG 

ATATCTTCCG 

TCTTGGCATT 

GTCGAAGCTC 

CACCATTGTG 

GAAGAAATTT 



AAAATGGATC 
GCTATAATTG 
GTACCAGTTT 
GATTTTGCCA 
GTTGTGTCAT 
TCAAGAACAA 
ACTGTATATG 
AATAAAGTTG 
GATGTGAAGA 
TTGATGATGT 
GATCCTTTGT 
TTTGAGTATA 
AAGATCTGTA 
GAAAGAAAGA 
GATCACATTC 
GTTGAAAACA 
GCCGAGTTAG 
CTTGGCCCAG 
GTGATCAAGG 
CTTCACGATG 
GCTTGGTGGC 
AGGTTCTTCG 
TTTGGCGTTG 
ACTTTGGGAC 
GACACCACAG 
TTGAAACCAA 
TATAATTATG 



TTCTCTTACT 
TCTCTAGACT 
TTGGTAATTG 
AAAAATTTGG 
CTCCTGAATT 
GAAATGTTGT 
GTGAACACTG 
TGCAGCAATA 
AAAATCCTGA 
ATAATAATAT 
TTGTTAAGCT 
ATTATGGTGA 
AAGAAGTTAA 
AGCTTTCAAA 
TTGAGGCTCA 
TCAATGTTGC 
TCAACCACCC 
GAGTGCAAGT 
AGACGCTTCG 
CAAAGCTTGG 
TAGCTAACAA 
AAGAGGAGAA 
GTAGGAGGAG 
GTTTGGTTCA 
AGAAAGGTGG 
GGTCTTGCTG 



IGLFFAILIA 
LRl^GQRNIiW 
RIMTVPFFTN 
FDRRFESEDD 
QLFKDYFVDE 
TLWSIEWGIA 
IPLLVPHMNL 
NGNDFRYLPF 
HILKHSTIVL 



IIVSRLRSKR 
VSSPELAKEV 
KWQQYRGGW 
PLFVKLKALN 
RKKLSNTKSL 
ELVNHPHIQK 
HDAKLGGFDI 
GVGRRSCPGI 
KPRSC 



FKLPPGPIPV 
LHTQGVEFGS 
EFEVASVIED 
GERSRliAQSF 
DSNALKCAID 
KLRDEIDTVIi 
PAESKILVNA 
ILALPILGIT 



PVFGr^QVG 
RTRNWFDIF 
VKKNPESATN 
EYNYGDFIPI 
HILEAQQKGE 
GPGVQVTEPD 
WWLANNPAHW 
LGRLVQNFEL 



DDLInIHRNLTD 
TGKGQDMVFT 
GIVLRRRLQL 
LRPFLRGYLK 
INEDNVLYIV 
THKLPYLQAV 
KKPEEFRPER 
LPPPGQSKLD 
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NAME D134-AE11 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 229 

1 AACAATAAAA ATGGAGACAT TATTTAACAT CAAAGTTGCA GTTTCATTAG TAATTGTGAT 
61 AATTTTTCTG AGATGGGTAT GGAAATTCTT GAATTGGGTG TGGATTCAAC CAAAGAAAAT 
121 GGAAAAAAGA CTAAAAATGG AAGGTTTCAA AGGAAGCTCA TATAAGCTAT TATTTGGAGA 
181 TATGAAAGAA ATAAATACAA TGGTTGAAGA AGCCAAAACC AAGCCTATGA ATTTTACCAA 
241 TGATTATGTG GCTAGAGTCT TGCCTCACTT CACAAAGTTG ATGCTCCA-\T ATGGCAAGAA 
301 TAGCTTTATG TGGTTAGGGC CAAAACCAAC AATGTTTATC ACAGACCCTG AACTAATAAG 
361 GGAGATCTTG TCAAAAAGTT ACATATACCA GGAGATTCAA GGCAATCCAA TCACTAAGTT 
421 GCTAGCACAA GGACTAGTAA GTTATGAAGC AGAGAAATGG GCTAAGCATA GAAAAATTAT 
481 CAATCCTGCA TTTCACCTTG ACAAGTTGAA GCATATGCTA CCATCATTCT ACTTGAGTTG 
541 TTGTGACATG CTCAGAAAAT GGGAAAGTAT AGCTTCATCA GAGGGATCAG AAATAGACGT 
601 GTGGCCTTTT CTGGAAACGT TGACAAGCGA TGCTATTTCA AGAACAGCTT TTGGTAGTAA 
661 CTATGA-\GAC GGGAGACAGA TATTTGAGCT TCAAAAAGAA CAAGCTGAGT TGATTTTACA 
721 AGCAGCGCGA TGGCTTTACA TCCCCGGATG GAGGTTTGTG CCA.\CAAAGA GGAACAAGAG 
781 GATGAAGCAA ATCGCTAAAG AAGTACGATC ATTAGTGTTG GGAATAATCA ATAAGAGAAT 
841 AAGGGAAATG AAAGCAGGGG AAGCTGCAAA AGATGACTTA CTGGGAATAC TATTGGAATC 
901 TAATTTCAAA GAAATCCAAA TGCACGGAAA CAAGAACTTT GGCATGACTA TCGACGAAGT 
961 GATTGAAGAG TGCAAGTTAT TTTACTTTGC TGGGCAAGAA ACTACTTCAG TTTTGCTTGT 
1021 TTGGACTTTG ATTTTACTGA GTAAGCATGT CGATTGGCAA GAAAGAGCTA GAGAAGAAGT 
1081 TCATCAAGTC TTTGGAAGTA ACAAACCTGA TTATGACGCA TTGAATCAGT TGAAAGTTGT 
1141 AACGATGATA TTCAACGAGG TTTTAAGGTT GTACCCACCG GGAATTACCA TAAGTCGAAC 
1201 TGTACACGAG GATACCAAAT TAGGGAACTT GTCATTGCCA GCAGGGATAC AGCTTGTGTT 
1261 ACCTGCAATT TGGTTGCATC ATGACAATGA AATATGGGGA GATGATGCAA AGGAGTTCAA 
1321 ACCAGAGAGG TTTAGTGAAG GAGTTAATAA AGCAACAAAG GGTAAATTTG CATATTTTCC 
1381 ATTTAGTTGG GGACCAAGAA TATGTGTTGG ACTGAATTTT GCAATGTTAG AGGCAAAAAT 
1441 GGCACTTGCA TTGATTCTAC AACACTATGC TTTTGAGCTC TCTCCATCTT ATGCACATGC 
1501 TCCTCATACA ATTATCACTC TGCAACCTCA ACATGGTGCT CCTTTGATTT TGCGCAAGCT 
1561 GTAGCGCGGA TATATTGATT GGTTATCTAC TGTAG 



SEQ. ID. NO. 230 

1 METLFNIKVA VSLVIVIIFL RWVWKFLNWV 

61 INTMVEEAKT KPMNFTNDYV ARVLPHFTKL 

121 SKSYIYQEIQ GNPITKLLAQ GLVSYEAEKW 

181 LRKWESIASS EGSEIDVWPF LETLTSDAIS 

241 WLYIPGWRFV PTKKNKRMKQ lAKEVRSLVL 

3 01 EIQMHGNKNF GMTIDEVIEE CKLFYFAGQE 

361 FGSNKPDYDA LNQLKWTMI FNEVLRLYPP 

421 WLHHDNEIWG DDAKEFKPER FSEGVNKATK 

481 LILQHYAFEL SPSYAHAPHT IITLQPQHGA 



WIQPKKMEKR LKMEGFKGSS YKLLFGDMKE 
MLQYGKNSFM WLGPKPTMFI TDPELIREIL 
AKHRKIINPA FHLDKLKHML PSFYLSCCDM 
RTAFGSNYED GRQIFELQKE QAELILQAAR 
GIINKRIREM KAGEAAKDDL LGILLESNFK 
TTSVLLVWTL ILLSKHVDWQ ERAREEVHQV 
GITISRTVHE DTKLGNLSLP AGIQLVLPAI 
GKFAYFPFSW GPRICVGLNF AMLEAKMALA 
PLILRKL 
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NAME D209-AH12 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 231 

1 ATATGCAACT GAGATTTGAA GAATACCAAC TAACCAAAAT GCAGTTCTTC AGCTTGGTTT 
61 CCATTTTCCT ATTTCTATCT TTCCTCTTTT TGTTAAGGAT ATGGAAGAAC TCCAATAGCC 
121 AAAGCAAAAA GTTGCCACCA GGTCCATGGA AACTACCAAT ACTAGGAAGT ATGCTTCATA 
181 TGGTTGGTGG ACTACCACAC CATGTCCTTA GAGATTTAGC CAAAAAATAT GGACCACTTA 
241 TGCACCTTCA ATTAGGTGAA GTTTCTGCGG TTGTGGTTAC TTCTCCTGAT ACGGC AAAAG 
3 01 AAGTATTAAA AACTCATGAC ATCGCTTTTG CGTCTAGGCC TAGCCTTTTG GCCCCGGAGA 
361 TTGTCTGTTA CAATAGGTCT GATCTAGCCT TTTGCCCCTA TGGCGACTAT TGGAGACAAA 
421 TGCGTAAAAT ATGTGTCTTG GAAGTGCTCA GTGCCAAGAA TGTTCGGACA TTTAGCTCTA 
481 TTAGGCGGAA TGAAGTTCTT CGTCTCATTA ATTTTATCCG GTCATCTTCT GGTGAACCTA 
541 TTAATGTTAC GGAAAGGATC TTTTTGTTCA CAAGCTCCAT GACATGTAGA TCAGCGTTTG 
601 GGCAAGTGTT CAAAGAGCAA GACAAATTTA TACAACTAAT TAAAGAAGTG ATACTCTTAG 
661 CAGGAGGGTT TGATGTGGCT GACATATTCC CTTCACTGAA GTTTCTTCAT GTGCTCAGTG 
721 GAATGAAGGG TAAGATTATG AATGCACACC ATAAGGTAGA TGCCATTGTT GAGAATGTCA 
7 81 TCAATGAGCA CAAGAAAAAT CTTGCAATTG GGAAAACTAA TGGAGCGTTA GGAGGTGAAG 
841 ATTTAATTGA TGTTCTTCTA AGACTTATGA ATGATGGAGG CCTTCAATTT CCTATCACCA 
901 ACGACAACAT CAAAGCCATA ATTTTTGACA TGTTTGCTGC CGGGACAGAG ACTTCATCGT 
961 CAACAATTGT GTGGGCTATG GTAGAAATGG TGAAAAATCC AGCCGTATTC GCGAAAGCTC 
1021 AAGCAGAAGT AAGAGAAGCA TTTAGAGGAA AAGAAACTTT CGATGAAAAT GATGTGGAGG 
1081 AGCTAAACTA CCTAAAGTTA GTAATAAAAG AAACTCTAAG ACTTCATCCA CCGGTTCCAC 
1141 TTTTGCTCCC AAGAGAATGT AGGGAAGAGA CAAATATAAA CGGCTACACT ATTCCTGTAA 
12 01 AGACCAAAGT CATGGTTAAT GTTTGGGCTT TGGGAAGAGA TCCAAAATAT TGGAATGACG 

12 61 CAGAAACTTT TATGCCAGAG AGATTTGAGC AGTGCTCTAA GGATTTTGTT GGTAATAATT 

13 21 TTGAATATCT TCCATTTGGT GGCGGAAGGA GGATTTGTCC TGGGATTTCG TTTGGCTTAG 
13 81 CTAATGCTTA TTTGCCATTG GCTCAATTAC TATATCACTT CGATTGGAAA CTCCCTGCTG 
1441 GAATCGAACC AAGCGACTTG GACTTGACTG AGTTGGTTGG AGTAACTGCC GCTAGAAAAA 
1501 GTGACCTTTA CTTGGTTGCG ACTCCTTATC AACCTCCTCA AAAGTGATTT AATGGTTTCA 
1561 AGTTTTTATT TCCTAGCAAA CCCCACTATT GTCCTATCTT TCTTTTGGTG TTTTCGGTTT 
1621 TATCTACTCT AATACATGCA TCTTTTACCA TATAGGAATG TACCATGTTG TCG 

SEQ. ID. NO. 232 

1 MQLRFEEYQL TKMQFFSLVS IFLFLSFLFL LRIWKNSNSQ SKKLPPGPWK LPILGSMLHM 

61 VGGLPHHVLR DLAKKYGPLM HLQLGEVSAV WTSPDTAKE VLKTHDIAFA SRPSLLAPEI 

121 VCYNKSDLAF CPYGDYWRQM RKICVLEVLS AKNVRTFSSI RRNEVLRLIN FIRSSSGEPI 

181 NVTERIFLFT SSMTCRSAFG QVFKEQDKFI QLIKEVILLA GGFDVADIFP SLKFLHVLSG 

241 MKGKIMNAHH KVDAIVENVI NEHKKNLAIG KTNGALGGED LIDVLLRLMN DGGLQFPITN 

301 DNIKAIIFDM FAAGTETSSS TIVWAMVEMV KNPAVFAKAQ AEVREAFRGK ETFDENDVEE 

361 L.NYLKLVIKE TLRLHPPVPL LLPRECREET NINGYTIPVK TKVMVNVWAL GRDPKYWNDA 

421 ETFMPERFEQ CSKDFVGNNF EYLPFGGGRR ICPGISFGLA NAYLPLAQLL YHFDWKLPAG 

481 lEPSDLDLTE LVGVTAARKS DLYLVATPYQ PPQK 
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NAME D221-BB8 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 233 

1 OAATTATTTC ACGTGTTGTA TTCCTTGTCT ATGATAGGAA GCTCGTTACC TCAGCGTACA 
61 AACCCCAAAT AAAAAATGAA TTTCCTTGTG GTGTTAGCTT CTCTCTTTCT CTTTGTGTTC 
121 CTAATGAGGA TAAGCAAAGC AAAAAAGCTC CCTCCAGGTC CAAGGAAACT GCCTATAATA 
181 GGAAACCTTC ATCAAATTGG AAAATTACCT CATCGTTCAC TTCAAAAACT TTCTAATGAA 
241 TATGGGGATT TCATTTTCTT GCAATTAGGT TCTGTACCGA CTGTGGTTGT CTCCTCAGCT 
301 GACATTGCCC GAGAGATCTT TAGAACTCAC GACCTTGTTT TCTCAGGCCG TCCTGCTTTA 
361 TATGCTGCCA GAAAACTTTC CTACAATTGC TACAACGTTT CATTTGCACC CTATGGTAAT 
421 TACTGGAGAG AGGCTCGG AA AATTCTAGTG TTGGAGTTGC TAAGTACAAA GAGAGTACAA 
481 AGTTTCGAGG CAATTCGAGA CGAGGAAGTA AGTAGCTTGG TTCAAATTAT CTGTAGTTCC 
541 TTGAGCTCAC CTGTTAACAT AAGCACATTA GCACTATCCT TGGCAAATAA CGTTGTTTGT 
601 CGAGTGGCTT TTGGGAAAGG GAGTGCTGAA GGAGGAAATG ATTATGAGGA TAGGAAGTTT 
661 AATGAAATTC TATATGAGAC ACAAGAATTA TTGGGTGAGT TTAACGTTGC TGATTATTTT 
721 CCTCGGATGG CATGGATTAA CAAAATAAAT GGGTTTGATG AACGATTGGA AAATAATTTT 
7 81 AGGGAATTGG ATAAGTTTTA TGACAAAGTA ATAGAAGATC ATCTTAATTC ATGTAGCTGG 
841 ATGAAACAAA GGGATGATGA AGACGTTATT GATGTATTGC TTCGAATTCA AAAGGATCCA 
901 AGCCAAGAAA TTCCTCTCAA AGATGATCAC ATTAAGGGCC TTCTTGCGGA TATATTCATA 
961 GCTGGAACTG ATACATCATC AACAACCATA GAATGGGCAA TGTCAGAACT CATAAAAAAT 
1021 CCAAGAGTCT TGAGAAAAGC TCAAGAGGAA GTTAGAGAAG TTTCTAAGGG AAAACAAAAG 
1081 GTCCAAGAAA GTGATCTTTG CAAACTAGAT TACTTGAAAT TGGTCATCAA AGAAACCTTT 
1141 AGACTACACC CACCAGTCCC ATTACTAGTC CCTCGAGTAA CAACAGCCAG CTGCAAAATA 

12 01 ATGGAATACG AAATTCCAGT AAATACAAGA GTCTTCATCA ACGCGACAGC AAATGGGACA 
1261 AATCCAAAAT ACTGGGAAAA TCCATTGACA TTCTTGCCAG AGAGATTCTT GGATAAGGAG 
1321 ATTGATTACA GAGGC AAAAA TTTTGAGTTG TTGCCATTTG GGGCAGGGhG AAGAGGGTGT 

13 81 CCAGGAATTA ATTTTTCAAT ACCACTTGTT GAGCTTGCAC TTGCTAATCT ATTGTTTCAT 
1441 TATAATTGGT CACTTCCTGA AGGGATGCTA GCTAAGGATG TTGATATGGA AGAAGCTTTG 
1501 GGGATTACCA TGCACAAGAA ATCTCCCCTT TGCTTAGTAG CTTCTCATTA TACTTGTTGA 
1561 GATTTTAAAA GATTTTAGCA TAGCTATATA TAGCTTGAAG T 

SEQ, ID. NO. 234 

1 MNFLWLASL FLFVFU^IRIS KAKKLPPGPR KLPIIGNLHQ IGKLPHRSLQ KXiSNEYGDFI 

61 FLQLGSVPTV WSSADIARE IFRTHDLVFS GRPALYAARK LSYNCYNVSF APYGNYWREA 

121 RKILVLELLS TKRVQSFEAI RDEEVSSLVQ IICSSLSSPV NISTLALSLA NNWCRVAFG 

181 KGSAEGG^^DY EDRKFNEILY ETQELLGEFN VADYFPRMAW INKINGFDER LENNFRELDK 

241 FTOKVIEDHL NSCSWMKQRD DEDVIDVLLR IQKDPSQEIP LKDDHIKGLL ADIFIAGTDT 

3 01 SSTTIEWAMS ELIKNPRVLR KAQEEVREVS kgkqkvqesd lckldylklv iketfrlhpp 

361 VPLLVPRVTT ASCKIMEYEI PVOTRVFINA TANGTNPKYW ENPLTFLPER FLDKEIDYRG 

421 KNFELLPFGA GRRGCPGINF SIPLVELALA NLLFHYNWSL PEGMLAKDVD MEEALGITMH 

481 KKSPLCLVAS HYTC 
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NAME D222-0H4 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 235 

1 CAAAGACTAA AAGATGTCGG TCTTTGCGGT TATTTCATTC TTTCTACTTC TGTTTTTTCT 
61 TTTCAAATCA TATTTGCCCT CATCGAAAAC AAAGAAAAAT TCTCCACCAT CTCCTTCAAA 
121 GCTTCCGTTA ATCGGTCACT TCCACAAACT AGGCTTACAA CCTCACCGTT CTCTACAAAA 
181 ACTATCAAAT GAACATGGTC CCATGATGAT GCTTCAATTC GGTAGCGTAC CTGTGCTTAT 

2 41 CGCTTCATCA GCTGAAGCTG CTTCCGAAAT CATGAAAACC CAAGATTTGT CTTTTGCAAA 

3 01 CAAACCCATT TCAACCATTC CTAGCAAGCT TTTCTTCGGC CC AAAGGACG TTGCCTTCAC 
3 61 CCCATATGGG GATTACTGGA GGAATGCCAG AAGCATTTGC ATGCTTCAGC TTTTGAACAA 
421 CAAAAGAGTC CAGTCTTTTC GAAAGATAAG GGAAGAAGAG ACTTCTCTTC TTCTCCAGAG 
481 GATTAGGGAA TCGCCAAATT CAGAAGTCGA TTTAACGGAG CTGTTCGTTT CCATGACTAA 
541 CGACATAGTT TGCAGGGTGG CCTTAGGAAG GAAGTATTGT GATGGGGAAG AAGGGAGGAA 
601 ATTCAAGTCT TTGCTGTTAG AGTTTGTGGA ATTGTTGGGA GTTTTTAACA TTGGAGATTA 
661 CATGCCGTGG CTTGCATGGA TGAATCGTTT CAATGGTTTG AATGCCAAAG TGGATAAAGT 
7 21 GGCGAAAGAG TTTGATGCAT TTTTGGAGGA TGTGATTGAG GAACACGGAG GAAATAAGAA 
7 81 ATCAGACACT GAAGCTGAAG GGGCAGACTT CGTGGATATA TTATTGCAGG TTCACAAAGA 
841 AAACAAGGCT GGTTTTCAAG TCGAAATGGA TGCAATCAAA GCTATTATCA TGGATATGTT 
901 TGCTGCGGGA ACAGATACAA CTTCCACGCT TCTAGAGTGG ACAATGAACG AGCTCTTAAG 
961 AAATCCAAAA ACATTGAATA AGTTGAGAGA TGAGGTGAGA CAAGTGACTC AAGGGAAGAC 

1021 AGAGGTAACA GAGGATGACT TAGAGAAAAT GCCGTATTTA AGAGCAGCAG TTAAGGAGAG 
1081 TTCCAGGCTA CACTCTCCAG TGCCACTTCT ACCTCGAGAA GCAATTAAGG ATGCAAAGGT 
1141 TTTGGGCTAC GATATAGCTG CAGGGACTCA AGTCCTCGTT TGTCCATGGG CAATCTCAAG 
1201 AGATCCAAAC CTTTGGGAAA ATCCAGAGGA GTTTCAACCT GAAAGATTCT TGGATACTTC 
1261 CATAGATTAC AAAGGCTTAC ATTTCGAGTT AATTCCATTC GGTGCAGGTC GGAGGGGTTG 
1321 CCCTGGCATC ACATTTGCTA AGTTTGTGAA TGAGCTAGCA TTGGCAAGAT TAATGTTCCA 
13 81 TTTTGATTTC TCGCTACCAA AAGGAGTTAA GCATGAGGAT TTGGACGTGG AGGAAGCTGC 
1441 TGGAATTACT GTTAGAAGGA AGTTCCCCCT TTTAGCCGTC GCCACTCCAT GCTCGTGATT 
1501 TTTATTTTAG AGCTCATTCT ATGCCTTAAA AACTACTACT AGATAACTGC GTAGTAAATA 
1561 ATGCTTGGTA 

SEQ. ID. NO. 236 

1 MSVFAVISFF LLLFFLFKSY LPSSKTKKNS PPSPSKLPLI GHFHKLGLQP HRSLQKLSNE 

61 HGPMMMLQFG SVPVLIASSA EAASEIMKTQ DLSFANKPIS TIPSKLFFGP KDVAFTPYGD 

121 YWRNARSICM LQLLNNKRVQ SFRKIREEET SLLLQRIRES PNSEVDLTEL FVSMTNDIVC 

181 RVALGRKYCD GEEGRKFKSL LLEFVELLGV FNIGDYMPWL AWMNRFNGLN AKVDKVAKEF 

2 41 DAFLEDVIEE HGGNKKSDTE AEGADFVDIL LQVHKENKAG FQVEMDAIKA IIMDMFAAGT 

3 01 DTTSTLLEWT MNELLRNPKT LNKLRDEVRQ VTQGKTEVTE DDLEKMPYLR .\AVKESSRLH 
3 61 SPVPLLPREA IKDAKVLGYD lAAGTQVLVC PWAISRDPNL WENPEEFQPE RFLDTSIDYK 
421 GLHFELIPFG AGRRGCPGIT FAKFVNELAL ARLMFHFDFS LPKGVKHEDL DVEEAAGITV 
481 RRKFPLLAVA TPCS 
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NAME D224-AF10 
ORGANISM NICOTIANA TABACUM 
SEQ, ID. NO. 237 

1 ATTATCCATC ACCTAAAATG GAGAATTCTT GGGTTTTTCT AGCCTTGGCA GGGCTATCTG 
61 CATTAGCTTT TCTCTGTAAA ATAATCACCT GTCGAAGACC GGTTAACCGG AAAATACCAC 
121 CAGGTCCAAA ACCATGGCCC ATCATTGGCA ATTTGAACCT ACTTGGTCCT ATACCACATC 
181 AATCTTTTGA CTTGCTTTCC AAAAAATATG GAGAGTTGAT GCTGCTGAAA TTTGGCTCCA 

2 41 GGCCAGTTCT TGTTGCTTCA TCTGCTGAAA TGGCAAAACA GTTTTTAAAA GTACATGATG 

3 01 CTAATTTCGC CTCCCGTCCT ATGCTAGCTG GTGGAAAGTA TACAAGCTAT AACTATTGTG 
3 61 ACATGACATG GGCACCCTAT GGTCCCTATT GGCGCCAAGC ACGACGAATT TACCTTAACC 
421 AGATATTTAC TCCGAAAAGG CTAGACTCGT TCGAGTACAT TCGTGTTGAA GAAAGGCAGG 
481 CCTTGATTTC CCAGCTGAAT TCCCTTGCTG GAAAGCCATT TTTTCTCAAA GACCATTTGT 
541 CGCGATTTAG CCTCTGCAGC ATGACAAGGA TGGTTTTGAG CAACAAGTAC TTTGGTGAAT 
601 CAACAGTTAG AGTAGAAGAT TTGCAGTACC TGGTAGATCA ATGGTTCTTA CTTAATGGTG 
661 CTTTCAACAT TGGAGATTGG ATTCCATGGC TCAGCTTCTT GGACCTACAA GGCTATGTGA 
7 21 AACAAATGAA GGCTTTGAAA AGAACTTTTG ATAAGTTCCA CAACATTGTG CTAGATGATC 
7 81 GCAGGGCTAA GAAGAATGCA GAGAAGAACT TTGTCCCA.31A AGACATGGTT GATGTCTTGT 
841 TGAAGATGGC TGAAGATCCT AATCTGGAAG TCAAACTCAC TAATGACTGT GTCAAAGGGT 
901 TAATGCAGGA TTTACTAACT GGAGGAACAG ATAGCTTAAC AGCAGCAGTG CAATGGGCAT 
961 TTCAAGAACT TCTTAGACGG CCAAGGGTTA TTGAGAAGGC AACCGAAGAG CTTGACCGGA 

1021 TTGTCGGGAA AGAGAGATGG GTAGAAGAGA AAGATTGCTC GCAGCTATCT TACGTTGAAG 
1081 CAATCCTCAA GGAAACACTA AGGTTACATC CTCTAGGAAC TATGCTAGCA CCGCATTGTG 
1141 CTATAGAAGA TTGTAACGTG GCTGGTTATG ACATACAGAA AGGAACGACC GTTCTGGTGA 
1201 ATGTTTGGAC CATTGGAAGG GACCCAAAAT ACTGGGATAG AGCACAAGAG TTTCTCCCCG 

12 61 AGAGATTCTT AGAGAACGAC ATTGATATGG ACGGACATAA CTTTGCTTTC TTGCCATTTG 

13 21 GCTCGGGGCG AAGGAGGTGC CCTGGCTATA GCCTTGGACT TAAGGTTATC CGAGTAACAT 
13 81 TAGCCAACAT GTTGCATGGA TTCAACTGGA AATTACCTGA AGGTATGAAG CCAGAAGATA 
1441 TAAGTGTGGA AGAACATTAT GGGCTCACTA CACATCCTAA GTTTCCTGTT CCTGTGATCT 
1501 TGGAATCTAG ACTTTCTTCA GATCTCTATT CCCCCATCAC TTAATCCTAA GTGCTTCCTA 
1561 TTATAGCATC ATATCAATAT CCCTC 



SEQ. ID. m, 238 

1 MENSWVFLAIi AGLSALAFLC KIITCRRPVN RKIPPGPKPW PIIGNLNLLG PIPHQSFDLL 
61 SKKYGELMIiL KFGSRPVLVA SSAEMAKQFL KVHDANFASR PMLAGGKYTS YNYCDMTWAP 
121 YGPYWRQARR lYLNQIFTPK RLDSFEYIRV EERQALISQL NSLAGKPFFL KDHLSRFSLC 
181 SMTRMVLSrJK YFGESTVRVE DLQYLVDQWF LIuNGAFNIGD WIPWLSFLDL QGYVKQMKAL 

2 41 KRTFDKFHNI VLDDRRAKKN AEKNFVPKDM VDVLLKMAED PNLEVKLTND CVKGLMQDLL 

3 01 TGGTDSLTAA VQWAFQELLR RPRVIEKATE ELDRIVGKER WVEEKDCSQL S'/VEAILKET 
361 LRLHPLGTl^IL APHCAIEDCN VAGYDIQKGT TVLVNVWTIG RDPKYWDRAQ EFLPERFLEN 
421 DIDMDGHNFA FLPFGSGRRR CPGYSLGLKV XRVTLANMLH GFNWKLPEGM KPEDISVEEH 
481 YGLTTHPKFP VPVILESRLS SDLYSPIT 
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NAME D224'-BD11 
ORGAIJISM NI COT I ANA TABACUM 
SEQ. ID. NO. 239 

1 CTCATTATCC ATCACCTAAA ATGGAGAATT CTTGGGTTTT TCTAGCCTTG GCAGGGCTAT 
61 CTGCATTAGC TTTTCTCTGT AAAATAATCA CCTGTCGAAG ACCGGTTAAC CGGAAAATAC 
121 CACCAGGTCC AAAACCATGG CCCATCATTG GC AATTTGAA CCTACTTGGT CCTATACCAC 
181 ATCAATCTTT TGACTTGCTT TCCAAAAAAT ATGGAGAGTT GATGCTGCTG AAATTTGGCT 
241 CCAGGCCAGT TCTTGTTGCT TCATCTGCTG AAATGGCAAA ACAGTTTTTA AAAGTACATG 
301 ATGCTAATTT CGCCTCCCGT CCTATGCTAG CTGGTGGAAA GTATACAAGC TATAACTATT 
361 GTGACATGAC ATGGGCACCC TATGGTCCCT ATTGGCGCCA AGCACGACGA CGAATTTACC 
421 TTAACCAGAT ATTTACTCCG AAAAGGCTAG ACTCGTTCGA GTACATTCGT GTTGAAGAAA 
481 GGCAGGCCTT GATTTCCCAG CTGAATTCCC TTGCTGGAAA GCCATTTTTT CTCA.AAGACC 
541 ATTTGTCGCG ATTTAGCCTC TGCAGCATGA CAAGGATGGT TTTGAGCAAC AAGTATTTTG 
601 GTGAATCAAC AGTTAGAGTA GAAGATTTGC AGTACCTGGT AGATCAATGG TTCTTACTTA 
661 ATGGTGCTTT CAACATTGGA GATTGGATTC CATGGCTCAG CTTCTTGGAC CTACAAGGCT 
721 ATGTGAAACA AATGAAGGCT TTGAAAAGAA CTTTTGATAA GTTCCACAAC ATTGTGCTAG 
7 81 ATGATCACAG GGCTAAGAAG AATGCAGAGA AGAACTTTGT CCCAAAAGAC ATGGTTGATG 
841 TCTTGTTGAA GATGGCTGAA GATCCTAATC TGGAAGTCAA ACTCACTAAT GACTGTGTCA 
901 AAGGGTTAAT GCAGGATTTA CTAACTGGAG GAACAGATAG CTTAACAGCA GCAGTGCAAT 
961 GGGCATTTCA AGAACTTCTT AGACAGCCAA GGGTTATTGA GAAGGCAACC GAAGAGCTTG 
1021 ACCGGATTGT CGGGAAAGAG AGATGGGTAG AAGAGAAAGA TTGCTCGCAG CTATCTTACG 
1081 TTGAAGCAAT CCTCAAGGAA ACACTAAGGT TACATCCTCT AGGAACTATG CTAGCACCGC 
1141 ATTGTGCTAT AGAAGATTGT AACGTGGCTG GTTATGACAT ACAGAAAGGA ACGACCGTTC 
1201 TGGTGAATGT TTGGACCATT GGAAGGGACC CAAAATACTG GGATAGAGCA CAAGAGTTTC 
1261 TCCCCGAGAG ATTCTTAGAG AACGACATTG ATATGGACGG ACATAACTTT GCTTTCTTGC 
1321 CATTTGGCTC GGGGCGAAGG AGGTGCCCTG GCTATAGCCT TGGACTTAAG GTTATCCGAG 
1381 TAACATTAGC CAACATGTTG CATGGATTCA ACTGGAAATT ACCTGAAGGT ATGAAGCCAG 
1441 AAGATATAAG TGTGGAAGAA CATTATGGGC TCACTACACA TCCTAAGTTT CCTGTTCCTG 
1501 TGATCTTGGA ATCTAGACTT TCTTCAGATC TCTATTCCCC CATCACTTAA TCCTAAGTGC 
1561 TTCCTATTAT AGCATCATAT CAATATCCCT C 

SEQ. ID. NO. 240 

1 MENSWVFLAL AGLSALAFLC KIITCRRPVN RKIPPGPKPW PIIGNLNLLG PIPHQSFDLL 

61 SKKYGELMLL KFGSRPVLVA SSAEMAKQFL KVHDANFASR PMLAGGKYTS YNYCDMTWAJP 

121 YGPYWRQARR RIYLNQIFTP KRLDSFEYIR VEERQALISQ LNSUVGKPFF LKDHLSRFSL 

181 CSMTRMVLSN KYFGESTVRV EDLQYLVDQW FLLNGAFNIG DWIPWLSFLD LQGYVKQMKA 

2 41 LKRTFDKFHN IVLDDHRAKK NAEKNFVPKD MVDVLLKMAE DPNLEVKLTN DCVKGLMQDL 

3 01 LTGGTDSLTA AVQWAFQELL RQPRVIEKAT EELDRIVGKE RWVEEKDCSQ LSWEAILKE 
3 61 TLRLHPLGTM LAPHCAIEDC NVAGYDIQKG TTVLVNVWTI GRDPKYWDRA QEFLPERFLE 
421 imiDMDGHNF AFLPFGSGRR RCPGYSLGLK VIRVTLANML HGFNWKLPEG MKPEDISVEE 
481 HYGLTTHPKF PVPVILESRL SSDLYSPIT 
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NAME D228-AD7 
ORGANISM NICCTIANA TABACUM 
SEQ, ID, NO. 241 

1 TGATAATGCT CTTTCTACTC TTTGTAGCCC TTCCTTTCAT TCTTATTTTT CTTCTTCCTA 
61 AATTCAAAAA TGGTGGAAAT AACAGATTGC CACCAGGTCC TATAGGTTTA CCATTCATTG 
121 GAAATTTGCA TCAATACGAT AGTATAACTC CTCATATCTA TTTTTGGAAA CTTTCAAAAA 
181 AATATGGCAA AATCTTCTCA TTAAAACTTG CTTCTACTAA TGTGGTAGTA GTTTCTTCAG 
241 CAAAATTAGC AAAAGAAGTA TTGAAAAAAC AAGATTTAAT ATTTTGTAGT AGACCATCTA 
3 01 TTCTTGGCCA ACAAAAACTG TCTTATTATG GTCGTGATAT TGCTTTTAAT GATTATTGGA 

3 61 GAGAAATGAG AAAAATTTGT GTTCTTCATC TTTTTAGTTT AAAAAAAGTT CAATTATTTA 
421 GTCCAATTCG TGAAGATGAA GTTTTTAGAA TGATTAAGAA AATATCAAAA CAAGCTTCTA 

4 81 CTTCACAAAT TATTAATTTG AGTAATTTAA TGATTTCATT AACAAGTACA ATTATTTGTA 
541 GAGTTGCTTT TGGTGTTAGG ATTGAAGAAG AAGCACATGC AAGGAAGAGA TTTGATTTTC 
601 TTTTGGCCGA GGCAC AAGAA ATGATGGCTA GTTTCTTTGT ATCTGATTTT TTTCCCTTTT 
661 TAAGTTGGAT TGATAAATTA AGTGGATTGA CATATAGACT TGAGAGGAAT TTCAAGGATT 
721 TGGATAATTT TTATGAAGAA CTCATTGAGC AACATCAAAA TCCTAATAAG CCAAAATATA 
7 81 TGGAAGGAGA TATTGTTGAT CTTTTGCTAC AATTGAAGAA AGAGAAATTA ACACCACTTG 
841 ATCTCACTAT GGAAGATATA AAAGGAATTC TCATGAATGT GTTAGTTGCA GGATCAGACA 
901 CTAGTGCAGC TGCTACTGTT TGGGCAATGA CAGCCTTGAT AAAGAATCCT AAAGCCATGG 
961 AAAAAGTTCA ATTAGAAATC AGAAAATCAG TTGGG AAGAA AGGCATTGTA AATGAAGAAG 

1021 ATGTCCAAAA CATCCCTTAT TTTAA-AGCAG TGATAAAGGA AATATTTAGA TTGTATCCAC 
1081 CAGCTCCACT TTTAGTTCCA AGAGAATCAA TGGAAAAAAC CATATTAGAA GGTTATGAAA 
1141 TTCGGCCAAG AACCATAGTT CATGTTAACG CTTGGGCTAT AGCAAGGGAT CCTGAAATAT 
1201 GGGAAAATCC AGATGAATTT ATACCTGAGA GATTTTTGAA TAGCAGTATC GATTACAAGG 
1261 GTCAAGATTT TGAGTTACTT CCATTTGGTG CAGGCAGAAG AGGTTGCCCA GGTATTGCAC 
1321 TTGGGGTTGC ATCCATGGAA CTTGCTTTGT CAAATCTTCT TTATGCATTT GATTGGGAGT 
13 81 TGCCTTATGG AGTAAAAAAA GAAGACATCG ACACAAACGT TAGGCCTGGA ATTGCCATGC 
1441 ACAAGAAAAA CGAACTTTGC CTTGTCCCAA AAAATTATTT ATAAATTATA TTGGGACGTG 
1501 GATCTCATGC TAGTTCTGTG CGGTCAGCTA AGCTTATTAT TTTTGGCTCA AATTATGTAT 
1561 ACATAATTAG TACATGTTTA AAATGTATAA ATATAGTAGA ACCATTCTCA TGGTT 

SEQ. ID. NO. 242 

1 ^^LFLL>FVALP FILIFLLPKF KNGGNNRLPP GPIGLPFIGN LHQ\T3SITPH IVFWKLSKKY 

61 GKIFSLKLAS TNWWSSAK LAKEVLKKQD LIFCSRPSIL GQQKLSYYGR DIAFNDYWRE 

121 MRKICVLHLF SLKKVQLFSP IREDEVFRMI KKISKQASTS QIINI>SNLMI SLTSTIICRV 

181 AFGVRIEEEA HARKRFDFLL AEAQEMl^tASF FVSDFFPFLS WIDKLSGLTY RLERNFKDLD 

241 NFYEELIEQH QNPNKPKYME GDIVDLLLQL KKEKLTPLDL TMEDIKGILM NVLVAGSDTS 

301 AAATWJAMTA LIKNPKA14EK VQLEIRKSVG KKGIVNEEDV QNIPYFKAVI KEIFRLYPPA 

361 PLLVPRESME KTILEGYEIR PRTIVHVNAW AIARDPEIWE NPDEFIPERF LNSSIDYKGQ 

421 DFELLPFGAG RRGCPGIALG VASMELALSN LLYAFDWELP YGVKKEDIDT NVRPGIAMHK 

481 KNELCLVPKN YL 
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NAME D22 8-AH8 

ORGANISM NI COT I ANA 
SEQ. ID. NO. 243 

1 TGATAATGCT 
61 AATTCAAAAA 
121 GAAATTTGCA 
181 AATATGGCAA 
241 CAAAATTAGC 
301 TTCTTGGCCA 
3 61 ATTATTGGAG 
421 AATTATTTAG 
481 AAGCTTCTAC 
541 TTATTTGTAG 
601 TTGATTTTCT 
661 TTCCCTTTTT 
721 TCAAGGATTT 
7 81 CAAAATATAT 
841 CACCACTTGA 
901 GATCAGACAC 
961 AAGCCATGGA 
1021 ATGAAGAAGA 
1081 TGTATCCACC 
1141 GTTATGAAAT 
1201 CTGAAATATG 

12 61 ATTACAAGGG 
1321 GTATTGCACT 

13 81 ATTGGGAGTT 
1441 TTGCCATGCA 
1501 TGGGACGTGG 
1561 ATTATGTATA 
1621 GGTT 

SEQ. ID. NO. 244 

1 MLFLLFVALP 
61 GKIFSLKLAS 

121 miEMRKICVL 
181 CRVAFGVRFE 
241 DLDNFYEELI 
3 01 DTSAAATVWA 
3 61 PPAPLLVPRE 
421 KGQDFELLPF 
481 MHKKNELCLV 



TABACUM 

CTTTCTACTC 
TGGTGGAAAT 
TCAATATGAT 

AATCTTCTCA 
AAAAGAAGTA 
ACAAAAACTG 
AGAAATGAGA 
TCCAATTCGT 
TTCACAAATT 
AGTTGCTTTT 
TTTGGCCGAG 
AAGTTGGATT 
GGATAATTTT 
GGAAGGAGAT 
TCTCACTATG 
TAGTGCAGCT 
.\AAAGTTCAA 
TGTCCAAAAC 
AGCTCCACTT 
TCGGCCAAGA 
GGAAAATCCA 
TCAAGATTTT 
TGGGGTTGCA 
GCCTTATGGA 
CAAGAAAAAC 
ATCTCATGCT 
CATAATTAGT 



TTTGTAGCCC 
AACAGATTGC 
AGTATAACTC 

TTAAAACTTG 
TTGAAAAAAC 
TCTTATTATG 
AAAATTTGTG 
GAAGATGAAG 
ATTAATTTGA 
GGTGTTAGGT 
GCACAAGAAA 
GATAAATTAA 
TATGAAGAAC 
ATTGTTGATC 
GAAGATATAA 
GCTACTGTTT 
TTAGAAATCA 
ATCCCTTATT 
TTAGTTCCAA 
ACCATAGTTC 
GATGAATTTA 
GAGTTACTTC 
TCCATGGAAC 
GTGAAAAAAG 
GAACTTTGCC 
AGTTCTGTGC 
ACATGTTTAA 



TTCCTTTCAT 
CACCAGGTCC 
CTCATATCTA 
CTTCTACTAA 
AAGATTTAAT 
GTCGTGATAT 
TTCTTCATCT 
TTTTTAGAAT 
GTAATTTAAT 
TTGAAGAAGA 
TGATGGCTAG 
GTGGATTGAC 
TCATTGAGCA 
TTTTGCTACA 
AAGGAATTCT 
GGGCAATGAC 
GAAAATCAGT 
TTAAAGCAGT 
GAGAATCAAT 
ATGTTAACGC 
TACCTGAGAG 
CATTTGGTGC 
TTGCTTTGTC 
AAGACATCGA 
TTGTCCCAAA 
GGTCAGCTAA 
AATGTATAAA 



TCTTATTTTT 
TATAGGTTTA 
TTTTTGGAAA 

TGTGGTAGTA 
ATTTTGTAGT 
TGCTTTTGCA 
TTTTAGTTTA 
GATTAAGAAA 
GATTTCATTA 
AGCACATGCA 
TTTCTTTGTA 
ATATAGACTT 
ACATCAAAAT 
ATTGAAGAAA 
CATGAATGTG 
AGCCTTGATA 
TGGGAAGAAA 
GATAAAGGAA 
GGAAAAAACC 
TTGGGCTATA 
ATTTTTGAAT 
AGGCAGAAGA 
AAATCTTCTT 
CACAAACGTT 
AAATTATTTA 
GCTTATTATT 
TATAGTAGAA 



CTTCTTCCTA 
CCATTCATTG 
CTTTCCAAAA 
GTTTCTTCAG 
AGACCATCTA 
CCTTATAATG 
AAAAAAGTTC 
ATATCAAAAC 
ACAAGTACAA 
AGGAAGAGAT 
TCTGATTTTT 
GAGAGGAATT 
CCTAATAAGC 
GAGAAATTAA 
TTAGTTGCAG 
AAGAATCCTA 
GGCATTGTAA 
ATATTTAGAT 
ATATTAGAAG 
GCAAGGGATC 
AGCAGTATCG 
GGTTGCCCAG 
TATGCATTTG 
AGGCCTGGAA 
TAAATTATAT 
TTTGGCTCAA 
CCATTCTCAT 



FILIFLLPKF 
TNWWSSAK 
HLFSLKKVQL 
EEAHARKRFD 
EQHQNPNKPK 
MTALIKNPKA 
SMEKTILEGY 
GAGRRGCPGI 
PKNYL 



KNGGNNRLPP 
LAKEVLKKQD 
FSPIREDEVF 
FLLAEAQEMM 
YMEGDIVDLL 
MEKVQLEIRK 
EIRPRTIVHV 
ALGVASMELA 



GPIGLPFIGN 
LIFCSRPSIL 
RMIKKISKQA 
ASFFVSDFFP 
LQLKKEKLTP 
SVGKKGIVNE 
NAWAIARDPE 
LSNLLYAFDW 



LHQYDSITPH 
GQQKLSY^iTGR 
STSQIINLSN 
FLSWIDKLSG 
LDLTIffiDIKG 
EDVQNIPYFK 
IWENPDEFIP 
ELPYGVKKED 



lYFWKLSKKY 
DIAFAPYNDY 
LMISLTSTII 
LTYRLERNFK 
ILMNVLVAGS 
AVIKEIFRLY 
ERFLNSSIDY 
XDTNVRPGIA 
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NAME D235"AB1 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 245 

1 AAAATTCATA ATGGTTTTTC CCATAGAAGC CTTTGTAGGA CTAGTAACCT TCACATTTCT 
61 CTTATACTTC CTATGGACAA AAAAATCTCA AAAACTTCCA AAACCCTTAC TACCGAAAAT 
121 CCCCGGhGGA TGGCCGGTAA TCGGCCATCT TTTTCACTTC AATAACGACG GCGACGACCG 
181 TCCATTAGCT CGAAAACTCG GAGACTTAGC TGATAAATAC GGCCCCGTTT TCACTTTTCG 
241 GCTAGGTCTT CCCCTTGTGC TAGTTGTAAG CAGTTACGAA GCTATAAAAG ATTGCTTCTC 
301 TACAAATGAC GCCATTTTCT CCAATCGTCC AGCTTTTCTT TACGGCGAAT ACCTTGGCTA 
3 61 CAATAATACA ATGGTTTTTC TAGCAAATTA CGGACCTTAC TGGCGAAAAA ATCGTAAATT 
421 AGTCATTCAG GAAGTTCTCT CTGCTAGTCG TCTCGAAAAA TTCAAACAAG TGAGATTCAC 
481 CAGAATTCAA ACGAGCATTA AGAATTTATA CACTCGAATT AATGGAAATT CGAGTACGAT 
541 AAATCTAACT GATTGGTTAG AAGAATTGGA TTTTGGTCTG ATCGTGAAAA TGATCGCTGG 
601 GAAAAATTAT GAATCCGGTA AAGGAGATGA ACAAGTGGAA AGATTTAAGA ATGCGTTTAA 
661 GGATTTTATG GTTTTATCAA TGGAATTTGT ATTATGGGAT GCATTTCCAA TTCCATTATT 
721 TAAATGGGTG GATTTTCAAG GTCATATTAA GGCAATGAAA AGGACATTTA AGGATATAGA 

7 81 TTCTGTTTTT CAGAACTGGT TAGAGGAACA TATTAATAAA AGAGAAAAAA TGGAGGTTGG 

8 41 TGCAGAAGGG AATGAACAAG ATTTCATTGA TGTGGTGCTT TCAAAATTGA GTAAAGAATA 
901 TCTTGATGAA GGTTACTCTC GTGATACTGT CATTAAAGCA ACAGTTTTTA GTTTGGTCTT 
961 GGATGCAGCA GACACAGTTG CTCTTCACAT AAATTGGGGA ATGACATTAT TGATAAACAA 

1021 TCAAAATGCC TTGATGAAAG CACAAGAAGA GATAGACACA AAAGTTGGTA AGTATAGATG 
1081 GGTAGAAGAG AGTGATATTA AGGATTTAGT ATACCTCCAA GCTATTGTTA AAAAGGTGTT 
1141 ACGATTATAT CCACCAGGAC CTTTGTTAGT ACCACATGAA TATGTAAAGG ATTGTGTTGT 
1201 TAGTGGATAT CACATTCCTA AAGGGACTAG ATTATTCGCA AACGTCATGA AACTGCAGCG 
12 61 CGATCCTAAA CTCTTGTCAA ATCCTGATAA GTTCGATCCA GAGAGATTCA TCGCTGGTGA 
1321 TATCGACTTC CGTGGTCACC ACTATGAGTT TATCCCATTT GGTTCTGGAA GACGATCTTG 
1381 TCCGGGGATG ACTTATGCAT TGCAAGTGGA ACACCTAACA ATGGCACATT TAATCCAGGG 
1441 TTTCAATTAC AAAACTCCAA ATGACGAGGC CTTGGATATG AAGGAAGGTG CAGGCATAAC 
1501 AATACGTAAG GTAAATCCGG TGGAATTGAT AATAACGCCT CGCTTGGCAC CTGAGCTTTA 
1561 CTAAAACCTA AGATCTTTCA TCTTGGTTGA TCATTGTTTA ATACTCCTAG ATAGATGGGT 
1621 ATTCATC 



SEQ. ID. NO. 246 

1 MVFPIEAFVG LVTFTFLLYF LOTKKSQKLP KPLLPKIPGG WPVIGHLFHF l^NDGDDRPLA 

61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AIKDCFSTND AIFSNRPAFL YGEYLGYNNT 

121 l^FLANYGPY WRKNRKLVIQ E^/LSASRLEK FKQVRFTRIQ TSIKNLYTRI NGNSSTINLT 

181 DWLEELDFGL IVKMIAGKNY ESGKGDEQVE RFKNAFKDFM VLSMEFVLWD AFPIPLFKWV 

241 DFQGHIKAMK RTFKDIDSVF QNWLEEHItTK REKMEVGAEG l^QDFIDWL SKLSKEYLDE 

301 GYSRDTVIKA TVFSLVLDAA DTVALHINWG MTLLI1^INQNA LMKAQEEIDT KVGKYRWVEE 

361 SDIKDLVYLQ AIVKKVLRLY PPGPLLVPHE YVKDCWSGY HIPKGTRLFA NVMKLQRDPK 

421 LLSNPDKFDP ERFIAGDIDF RGHHYEFIPF GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 

481 KTPNDEALDM KEGAGITIRK VNPVELIITP RLAPELY 
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NAME D243-AA2 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 247 

1 CAAAAAATCA TTTCTCTCGT CTAAAATGGA TCTTCTCTTA CTAGAGAAGA CCTTAATTGG 
61 TCTTTTCTTT GCCATTTTAA TCGCTTTAAT TGTCTCTAAA CTTCGTTCAA AGCGTTTTAA 
121 GCTTCCTCCA GGACCAATTC CAGTACCAGT TTTTGGTAAT TGGCTTCAAG TTGGTGATGA 
181 TTTAAACCAC AGAAATCTTA CTGATTATGC CAAAAAATTT GGCG ATCTTT TCTTGTTAAG 
241 AATGGGTCAA CGTAACTTAG TTGTTGTGTC ATCTCCTGAA TTAGCTAAAG AAGTTTTACA 
301 CACACAAGGT GTTGAATTTG GTTCAAGAAC AAGAAATGTT GTGTTTGATA TTTTTACTGG 
361 AAAAGGTCAA GATATGGTTT TTACTGTATA TGGTGAACAT TGGAGAAAAA TGAGGAGAAT 
421 TATGACTGTA CCATTTTTTA CTAATAAAGT TGTGCAACAG TAT AaAGGGG GGTGGGAGTT 
481 TGAGGTGGCA AGTGTAATTG AGGATGTGAA AAAAAATCCT GAATCTGCTA CTAATGGGAT 
541 CGTATTAAGG AGGAGATTAC AATTAATGAT GTATAATAAT ATGTTTAGGA TTATGTTTGA 
601 TAGGAGATTT GAGAGTGAAG ATGATCCTTT GTTTGTTAAG CTTAAGGCTT TGAATGGTGA 
661 AAGGAGTAGA TTGGCTCAAA GTTTTGAGTA TAATTATGGT GATTTTATTC CAATTTTGAG 
721 GCCTCTTTTG AGAGGTTATT TGAAGATCTG TAAAGAAGTT AAGGAGAAGA GGCTGCAGCT 
781 TTTCAAAGAT TACTTTGTTG ATGAAAGAAA GAAGCTTTCA AATACCAAGA GCTCGGACAG 
841 CAATGCCCTA AAATGTGCGA TTGATCACAT TCTTGAGGCT CAACAGAAGG GAGAGATCAA 
901 TGAGGACAAC GTTCTTTACA TTGTTGAAAA CATCAATGTT GCTGC AATTG AAACAACATT 
961 ATGGTCAATT GAGTGGGGTA TCGCCGAGCT AGTCAACCAC CCTCACATCC AAAAGAAACT 
1021 GCGCGACGAG ATTGACACAG TTCTTGGACC AGGAGTGCAA GTGACTGAAC CAGACACCCA 
1081 CAAGCTTCCA TACCTTCAGG CTGTGATCAA GGAGGCACTT CGTCTCCGTA TGGCAATTCC 
1141 TCTATTAGTC CCACACATGA ACCTTCACGA CGCAAAGCTT GGCGGGCTTG ATATTCCAGC 
1201 AGAGAGCAAA ATCTTGGTTA ACGCTTGGTG GTTAGCTAAC AACCCGGCTC ATTGGAAGAA 
1261 ACCCGAAGAG TTCAGACCCG AGAGGTTCTT TGAAGAGGAG AAGCATGTTG AGGCCAATGG 
1321 CAATGACTTC AGATATCTTC CGTTTGGCGT TGGTAGGAGG AGCTGCCCTG GAATTATACT 
1381 TGCATTGCCA ATTCTTGGCA TCACTTTGGG ACGTTTGGTT CAGAACTTTG AGCTGTTGCC 
1441 TCCTCCAGGC CAGTCGAAGC TCGACACCAC AGAGAAAGGT GGACAGTTCA GTCTCCACAT 
1501 TTTGAAGCAT TCCACCATTG TGTTGAAACC AAGGTCTTTC TGAACTTTGT GATCTTATTA 
1561 ATTAAGGGGT TCTGAAGAAA TTTGATAGTG TTGG 



SEQ. ID. NO. 248 

1 MDLLLLEKTL IGLFFAILIA LIVSKLRSKR 
61 YAKKFGDLFL LRMGQRNIiW VSS PELAKEV 
121 VYGEHWRKMR RIMTVPFFTN KWQQYRGGW 
181 MMYl^JNMFRIM FDRRFESEDD PLFVKLKALN 
241 ICKEVKEKRL QLFKDYFVDE RKKLSNTKSS 
301 ENirrVAAIET TLWSIEWGIA ELVNHPHIQK 
361 IKEALRLRMA IPLLVPHMNL HDAKLGGLDI 
421 FFEEEKHVEA NGNDFRYLPF GVGRRSCPGI 
481 TTEKGGQFSL HILKHSTIVL KPRSF 



FKIiPPGPIPV PVFGNWLQVG DDLNHRNLTD 

LHTQGVEFGS RTR^IWFDIF TGKGQDMVFT 

EFEVASVIED VKKNPESATN GIVLRRRLQL 

GERSRLAQSF EYNYGDFIPI LRPLLRGYLK 

DSNALKCAID HILEAQQKGE INEDNVLYIV 

KLRDEIDTVL GPGVQVTEPD THKLPYLQAV 

PAESKILVNA WWLANNPAHW KKPEEFRPER 

ILALPILGIT LGRLVQNFEL LPPPGQSKLD 
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NAME D244-AD4 
ORGANISM NICOTIANA 
SEQ. ID. NO. 249 

1 AACATTTTGC 
61 ATTATCATAT 
121 CTATTATCTT 
181 CATGGCCTAT 
241 CCCGAATATT 
301 TGTACCCCTA 
361 ATAAGGACTT 
421 Ca^iGGTTTAC 
481 AACATGTACC 
541 AAACTAGCAT 
601 ATATAAGTAA 
661 AGAGATATAG 
721 GC ATCATGTT 
781 AATACTTTGA 
841 CTATTCTTCA 
901 AAGATGCCAT 
961 TTTCTCAGGC 
1021 CAACCGCTGT 
1081 AACAAGGCCA 
1141 ACATAAAAAA 
1201 CTGTTCCTTT 
1261 TTCCTAAAGG 
1321 GGTCAGAGCC 
1381 CTCGCGGTCA 
1441 TAGGTTTTGC 
1501 TTAGTAAGCC 
1561 AGGTTAATCA 
1621 GAAAGTGCAA 

SEQ. ID, NO. 250 

1 MDYHISFHFQ 
61 FPRILGALAD 
121 YARFTYANFG 
181 VNISKWFEQL 
241 FKYFDFQGHI 
301 GFSQATVIKS 
361 TDIKNLVYLQ 
421 IWSEPEKFMP 
481 DFSKPSNTPI 



TABACUM 



AATATAGTTT 
TTCTTTCCAT 
ATGGAGAAGA 
TATAGGCCAT 
GGGCGCTTTG 
TTTGATTGTC 
CGCTGCCCGA 
TTATGCTAAT 
CTCGAGTACT 
CAAAGAATTA 
ATGGTTTGAA 
CAACATAGAG 
TGTTGTAGGG 
TTTCCAAGGT 
AGGATGGTTG 
AGATGCCATG 
CACTGTGATC 
TCATTTGATA 
AGAAGAGATA 
TTTAGTGTAC 
TCTTTTACCA 
TACTCGTCTA 
CGAAAAGTTT 
AAATTTTGAA 
GACTTTAGTG 
ATCAAACACG 
AGTTGAAGTT 
ATCATCAATC 



TCCTAGTCAG 
TTTCAA.GCTC 
ACACTCACTT 
CTTCGTCAGC 
GCAGATAAAT 
AACAATTGGG 
CCAACTTCTA 
TTTGGTCCTT 
AAACTCGAGA 
TATTCTTTGA 
CAATTGACTT 
GAGGATGAAG 
CAAATTGTTT 
CATATACAAT 
GATGATCATA 
CTTAAGGTAA 
AAGTCGACAG 
TGGGTAATGT 
GACATGAAAG 
CTTCAGGCTA 
CACGAAGCAG 
TATATCAATG 
ATGCCCAATA 
TTTATACCGT 
ACACATCTGA 
CCAATTGACA 
CTAATTACCC 
ATGGCTTGAG 



TTCTAGCCTC 
TTTTAGGGCT 
CAAGAAAATT 
TGAGTGGTAC 
ATGGACCTGT 
A.\GCAGCTAA 
TGGCTGGTGA 
ATTATAACCA 
AAATGAAACA 
CGCTGGGC AA 
TAAACATAAT 
AGGCACAACG 
TATATGACGC 
TGATGAACAA 
TGATGAACAA 
CACAACTTAA 
TCTTGAGTTT 
CCTTATTACT 
TGGGTAAAGA 
TCGTTAAAGA 
TGCAAGATTG 
CGTGGAAAGT 
GATTCTTGAC 
TTGGTTCTGG 
CTTTTGGTCG 
TGACAGAAGG 
CTCGTTTACC 
TAATTAGTTA 



CTTTTCCTTA 
TTTAGCCTTT 
AGCCCCTGAA 
TGATAAGAAT 
CTTCACACTG 
GGATTGTCTC 
AAGCATCGGG 
AGTGCGC AAA 
CATACGTGTT 
AAACAACATG 
CGTGAAGACA 
TTTCAGAAAG 
AATTCCATTC 
AATTTATAAA 
GGATGTAAAC 
TGAATTCAAA 
GATCTTAGAT 
GAACAATCCA 
GAGGTGGATT 
GACATTGCGC 
TAAAGTGACT 
ACATCGCGAT 
TAGCAAAGCA 
GAGACGGTCA 
CTTGCTTCAA 
CGTAGGCGTT 
TTCTAAGCTT 
TACTTTAATA 



GAAATAATGG 
GTGTTCTTGT 
ATCCCAGGGG 
ATCCCATTTC 
AGAATAGGGA 
ACAACGCATG 
TACAAGTATG 
CTAGCCCTAC 
TCTGAATTGG 
CAAAAAGTGA 
ATTTGTGGCA 
GCATTTAAGG 
CCATTGTTCA 
GACTTAGATT 
AATAAGGATC 
GCCTATGGTT 
GGAAATGACA 
CATGTTATGA 
GAAGATACTG 
TTGTATCCAC 
GGTTACCACA 
CCTGAAATTT 
AATATAGATG 
TGTCCAGGGA 
GGTTTTGATT 
ACTTTGCCTA 
TATTTATTTT 
TGTTTCTC 



ALLGLLAFVF 
KYGPVFTLRI 
PYYrJQVRKIiA 
TLNIIVKTIC 
QLMNKIYKDL 
TVLSLILDGN 
AIVKETLRLY 
NRFLTSKANI 
DMTEGVGVTL 



LSIILWRRTL 
GMYP^/LIVNN 
LQHVPSSTKL 
GKRYSNIEED 
DSILQGWLDD 
DTTAVHLIWV 
PPVPFLLPHE 
DARGQNFEFI 
PKVNQVEVLI 



TSRKLAPEIP 
WEAAKDCLTT 
EKl^IKHIRVSE 
EEAQRFRKAF 
HMMNKDVNNK 
MSLLLNNPHV 
AVQDCKVTGY 
PFGSGRRSCP 
TPRLPSKIiYL 



GAWPIIGHLR 
HDKDFAARPT 
LETSIKELYS 
KGIMFWGQI 
DQDAIDAMLK 
MKQGQEEIDM 
HIPKGTRLYI 
GIGFATLVTH 
F 



QLSGTDKNIP 
SMAGESIGYK 
LTLGKNNMQK 
VLYDAIPFPL 
VTQLNEFKAY 
KVGKERWIED 
NAWKVHRDPE 
LTFGRLLQGF 
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NAME D247-AH1 
ORGANISM NICOTIANA 
SBQ. ID. NO. 251 



1 


TGATAATGCT 


61 


AATTCAAAAA 


121 


GAAATTTGCA 


181 


AATATGGCAA 


241 


CAAAATTAGC 


301 


TTCTTGGCCA 


-361 


ATTATTGGAG 


421 


AATTATTTAG 


481 


AAGCTTCTAC 


541 


TTATTTGTAG 


601 


TTGATTTTCT 


661 


TTCCCTTTTT 


721 


TCAAGGATTT 


781 


CAAAATATAT 


841 


CACCACTTGA 


901 


GATCAGACAC 


961 


AAGCCATGGA 


1021 


ATGAAGAAGA 


1081 


TGTATCCACC 


1141 


GTTATGAAAT 


1201 


CTGAAATATG 


1261 


ATTACAAGGG 


1321 


GTATTGCACT 


1381 


ATTGGGAGTT 


1441 


TTGCCATGCA 


1501 


TGGGACGTGG 


SEQ. ID. N0» 2 52 


1 


MIjFLLFVALP 


61 


GKIFSLKLAS 


121 


WREMRKICVL 


181 


CRVAFGVRFE 


241 


DLDNFYEELI 


301 


DTSAAATVWA 


361 


PPAPLLVPRE 


421 


KGQDFELLPF 


481 


MHKKNELCLV 



TABACUM 



CTTTCTACTC 
TGGTGGAAAT 
TCAATATGAT 
AATCTTCTCA 
AAAAGAAGTA 
ACAAAAACTG 
AGAAATGAGA 
TCCAATTCGT 
TTCACAAATT 
AGTTGCTTTT 
TTTGGCCGAG 
AAGTTGGATT 
GGATAATTTT 
GGAAGGAGAT 
TCTCACTATG 
TAGTGCAGCT 
AAAAGTTCAA 
TGTCCAAAAC 
AGCTCCACTT 
TCGGCCAAGA 
GGAAAATCCA 
TCAAGATTTT 
TGGGGTTGCA 
GCCTTATGGA 
CAAGAAAAAC 
ATCTCAATTT 



TTTGTAGCCC 
AACAGATTGC 
AGTATAACTC 
TTAAAACTTG 
TTGAAAAAAC 
TCTTATTATG 
AAAATTTGTG 
GAAGATGAAG 
ATTAATTTGA 
GGTGTTAGGT 
GCACAAGAAA 
GATAAATTAA 
TATGAAGAAC 
ATTGTTGATC 
GAAGATATAA 
GCTACTGTTT 
TTAGAAATCA 
ATCCCTTATT 
TTAGTTCCAA 
ACCATAGTTC 
GATGAATTTA 
GAGTTACTTC 
TCCATGGAAC 
GTGAAAAAAG 
GAACTTTGCC 
AGTTCTGTGA 



TTCCTTTCAT 
CACCAGGTCC 
CTCATATCTA 
CTTCTACTAA 
AAGATTTAAT 
GTCGTGATAT 
TTCTTCATCT 
TTTTTAGAAT 
GTAATTTAAT 
TTGAAGAAGA 
TGATGGCTAG 
GTGGATTGAC 
TCATTGAGCA 
TTTTGCTACA 
AAGGAATTCT 
GGGCAATGAC 
GAAAATCAGT 
TTAAAGCAGT 
GAGAATCAAT 
ATGTTAACGC 
TACCTGAGAG 
CATTTGGTGC 
TTGCTTTGTC 
AAGACATCGA 
TTGTCCCAAA 
GGTCAGC 



TCTTATTTTT 
TATAGGTTTA 
TTTTTGGAAA 
TGTGGTAGTA 
ATTTTGTAGT 
TGCTTTTGCA 
TTTTAGTTTA 
GATTAAGAAA 
GATTTCATTA 
AGCACATGCA 
TTTCTTTGTA 
ATATAGACTT 
ACATCAAAAT 
ATTGAAGAAA 
CATGAATGTG 
AGCCTTGATA 
TGGGA.iVGAAA 
GATAAAGGAA 
GGAAAAAACC 
TTGGGCTATA 
ATTTTTGAAT 
AGGCAGAAGA 
AAATCTTCTT 
CACAAACGTT 
AAATTATTTA 



CTTCTTCCTA 
CCATTCATTG 
CTTTCCAAAA 
GTTTCTTCAG 
AGACCATCTA 
CCTTATAATG 
AAAAAAGTTC 
ATATCAAAAC 
ACAAGTACAA 
AGGAAGAGAT 
TCTGATTTTT 
GAGAGGAATT 
CCTAATAAGC 
GAGAAATTAA 
TTAGTTGCAG 
AAGAATCCTA 
GGCATTGTAA 
ATATTTAGAT 
ATATTAGAAG 
GCAAGGGATC 
AGCAGTACCG 
GGTTGCCCAG 
TATGCATTTG 
AGGCCTGGAA 
TAAATTATAT 



FILIFLLPKF 
TNWWSSAK 
HLFSLKKVQL 
EEAHARKRFD 
EQHQNPNKPK 
MTALIKl^PKA 
SMEKTILEGY 
GAGRRGCPGI 
PKNYL 



KNGGNNRLPP 
LAKEVLKKQD 
FSPIREDEVF 
FLLAEAQEMM 
VMEGDIVDLL 
MEKVQLEXRK 
EIRPRTIVHV 
ALGVASMELA 



GPIGLPFIGN 
LIFCSRPSIL 
RMIKKISKQA 
ASFFVSDFFP 
LQLKKEKLTP 
SVGKKGIVNE 
NAWAIARDPE 
LSNLLYAFDW 



LHQYDSITPH 
GQQKLSYYGR 
STSQIINLSN 
FLSWIDKLSG 
LDLTMEDIKG 
EDVQNIPYFK 
IWENPDEFIP 
ELPYGVKKED 



lYFmJLSKKY 
DIAFAPYNDY 
LMISLTSTII 
LTYRLERNFK 
ILMNVLVAGS 
AVIKEIFRLY 
ERFLNSSTDY 
IDTNVRPGIA 
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NAME D248-AA6 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 253 

1 CCAAAATCAT GGCTCTATCT TTCATATTCA TATCCATAAC CCTAATTTTT CTAGTTCATA 
61 AACTCTACCA CCGTCTTAGA TTCAAACTAC CACCAGGTCC GCGGCCGTTA CCGGTGGTCG 
121 GAAACCTCTA CGACATAAAA CCGGTGAGAT TCCGGTGCTT TGCCGATTGG GCCAAAACTT 
181 ACGGTCCGAT TTTCTCAGTA TACTTTGGGT CACAGTTAAA TGTTGTGGTA ACAACAGCTG 
241 AATTAGCTAA AGAAGTATTG AAAGAAAATG ACCAGAATTT AGCAGATAGA TTTAGGACTA 
301 GACCTGCAAA TAATTTGAGC AGA.AATGGGA TGGATTTGAT TTGGGCTGAT TATGGGCCTC 
361 ATTATGTGAA AGTAAGGAAG CTCTGTAATC TTGAGCTTTT TACTCCTAAA AGACTTGAAG 
421 CTCTTAGACC TATTAGAGAA GATGAAGTTA CTGCTATGGT TGAAAACATT TTCAAGGATT 
481 GTACTAAGCC TGATAACACA GGTAAAAGCT TGTTGATAAG AGAGTACTTA GGATCAGTAG 
541 CATTCAACAA CATTACAAGG TTAACATTTG GGAAAAGGTT CATGAACTCA AAAGGTGAGA 
601 TTGATGAGCA AGGTCAAGAA TTCAAGGGTA TTGTCTCTAA TGGCATCAhA ATTGGCGGAA 
661 AACTTCCCTT GGCAGAGTAT GTTCCATGGC TCCGTTGGTT TTTCACAATG GAAAACGAGG 
721 CACTCGTGAA GCACTCTGCA CGTAGAGACC GGTTAACAAG AATGATCATG GATGAACACA 
781 CACTGGCTCG CAAGAAAACT GGTGATACTA AGCAGCATTT TGTCGATGCA TTGCTTACTC 
841 TTCAGAAGCA GTATGATCTT AGTGATGACA CTGTTATTGG CCTCCTCTGG GATATGATTA 
901 CAGCAGGAAT GGACACAACA ACCATAACAG TGGAATGGGC AATGGCAGAA CTAGTTAAGA 
961 ACCCAAGAGT GCAACTAAAA GCTCAAGAGG AGCTTGACAG GGTAATCGGA ACGGATCGAA 
1021 TCATGTCAGA AACCGATTTC TCTAAACTTC CTTACCTACA ATGTGTAGCC AAAGAGGCTC 
1081 TAAGGTTGCA CCCTCCAACT CCTCTAATGC TTCCTCATAA GGCCAGTGCC AGTGTCAAAA 
1141 TTGGTGGTTA TGACATTCCT AAGGGGTCCA TCGTGCACGT GAACGTTTGG GCTGTCGCTC 
1201 GTGACCCAGC CGTGTGGAAG AACCCGTTGG AGTTCAGACC AGAGCGCTTC CTTGAGGAAG 
1261 ACGTTGACAT GAAGGGTCAC GACTATCGGT TATTGCCCTT TGGTGCAGGA AGGCGTGTTT 
1321 GCCCCGGTGC ACAACTTGCT ATCAACTTGG TCACATCTAT GTTGGGTCAT TTGTTGCATC 
13 81 ATTTl'ACATG GGCTCCGOCC CCGGGGGTTA ACCCGGAGGA TATTGACTTG GAGGAGAGCC 
1441 CTGGAACAGT AACTTACATG AAAAATCCAA TACAAGCTAT TCCAACTCCA AGATTGCCTG 
1501 CACACTTGTA TGGACGTGTG CCAGTGGATA TGTAAAACAT TTTGTTCTTT CCCTTTTTGG 
1561 TTATATGATG AG 



SEQ. ID. NO. 254 

1 MALSFIFISI TLIPLVHKIiY HRtiRFKLPPG 

61 IFSVYFGSQL NVVVTTAEL>A KEVLKENDQN 

121 KVRKLCITLEL FTPKRLEALR PIREDEVTAM 

181 NXTRLTFGKR FMNSKGEIDE QGQEFKGIVS 

241 KHSARRDKLT RMIMDEHTLA RKKTGDTKQH 

3 01 l^TTTITVXW AMAELVKNPR VQLKAQEELD 

361 HPPTPLMLPH KASASVKIGG YDIPKGSIVH 

421 MKGHDYRLLP FGAGRRVCPG AQLAINLVTS 

481 VTYMKNPIQA IPTPRLPAHL YGRVPVDM 



PRPLPWGNIi YDIKPVRFRC FADWAKTYGP 
LADRFRTRPA NNLSRNGl^L IWADYGPHYV 
VENIFKDCTK PDNTGKSLLI REYLGSVAFN 
NGIKIGGKLP LAEYVPWLRW FFTMENEALV 
FVDALLTLQK QYDLSDDTVI GLLWDMITAG 
RVIGTDRIMS ETDFSKLPYL QCVAKEALRL 
VNVWAVARDP A^/WKNPLEFR PERFLEEDVD 
MLGHLLHHFT WAPAPGVNPE DIDLEESPGT 
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NAME D2 49-AE8 

ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 255 

1 AATCACTAAT TTTCATGTAC TCTCATAGGT CAAAAGTTTC AACCAAAATC ATGGCTCTAT 
61 CCTTCATATT CATATCCATA ACCCTAATTT TTCTAGTTCA TAAACTCTAC CACCGTCTTA 
121 GATTCAAACT ACCACCAGGT CCGCGGCCGT TACCGGTGGT CGGAAACCTC TACGACATAG 
181 AACCGGTGAG ATTCCGGTGC TTTGCCGATT GGGCC AAAAC TTACGGTCCG ATTTTCTCAG 
241 TATACTTTGG GTCACAGTTA AATGTTGTGG TAACAACAGC TGAATTAGCT AAAGAAGTAT 
301 TGAAAGAAAA TGACCAGAAT TTAGCAGATA GATTTAGGAC TAGACCTGCA AATAATTTGA 
361 GCAGAAATGG GATGGATTTG ATTTGGGCTG ATTATGGGCC TCATTATGTG AAAGTAAGGA 
421 AGCTCTGTAA TCTTGAGCTT TTTACTCCTA AAAGACTTGA AGCTCTTAGA CCTATTAGAG 
481 AAGATGAAGT TACTGCTATG GTTGAAAACA TTTTCAAGGA TTGTACTAAG CCTGATAACA 
541 CAGGTAAAAG CTTGTTGATA AGAGAGTACT TAGGATCAGT AGCATTCAAC AACATTACAA 
601 GGTTAACATT TGGGAAAAGG TTCATGAACT CAAAAGGTGA GATTGATGAG CAAGGTCAAG 
661 AATTCAAGGG TATTGTCTCT AATGGCATCA AAATTGGCGG AAAACTTCCC TTGGCAGAGT 
721 ATGTTCCATG GCTCCGTTGG TTTTTCACA.^ TGGAAAACGA GGCACTCGTG AAGCACTCTG 
781 CACGTAGAGA CCGGTTAACA AGAATGATCA TGGATGAACA CACACTGGCT CGCAAGAAAA 
841 CTGGTGATAC TAAGCAGCAT TTTGTCGATG CATTGCTTAC TCTTCAGAAG CAGTATGATC 
901 TTAGTa^TGA CACTGTTATT GGCCTCCTCT GGGATATGAT TACAGCAGGA ATGGACACAA 
961 CAACCATAAC AGTGGAATGG GCAATGGCAG AACTAGTTAA GAACCCAAGA GTGCAACTAA 
1021 AAGCTCAAGA GGAGCTTGAC AGGGTAATCG GAACGGATCG AATCATGTCA GAAACCGATT 
1081 TCTCTAAACT TCCTTACCTA CAATGTGTAG CCAAAGAGGC TCTAAGGTTG CACCCTCCAA 
1141 CTCCTCTAAT GCTTCCTCAT AGGGCCAGTG CCAGTGTCAA AATTGGTGGT TATGACATTC 
1201 CTAAGGGGTC CATCGTGCAC GTGAACGTTT GGGCTGTCGC TCGTGACCCA GCCGTGTGGA 
1261 AGAACCCGTT GGAGTTCAGA CCAGAGCGCT TCCTTGAGGA AGACGTTGAC ATGAAGGGTC 
1321 ACGACTATCG GTTATTGCCC TTTGGTGCAG GAAGGCGTGT TTGCCCCGGT GCACAACTTG 
1381 CTATCAACTT GGTCACATCT ATGTTGGGTC ATTTGTTGCA TCATTTTACA TGGGCTCCGG 
1441 CCCCGGGGGT TAACCCGGAG GATATTGACT TGGAGGAGAG CCCTGGAACA GTAACTTACA 
1501 TGAAAAATCC AATACAAGCT ATTCCAACTC CAAGATTGCC TGCACACTTG TATGGACGTG 
1561 TGCCAGTGGA TATGTAAAAC 



SEQ. ID. NO. 256 

1 MYSHRSKVST KIMALSFIFI SITLIFLVHK 
61 RCPADWAKTY GPIFSVYFGS OLNWVTTAE 
121 DLIWADYGPH WKVRKLCNL ELFTPKRLEA 
181 LIREYLGSVA FNNITRLTFG KRFMNSKGEI 
241 RWFFTMENEA hVKHSARRDR LTRMIMDEHT 
301 VIGLLWDMIT AGMDTTTITV EWAMAELVKN 
361 YLQCVAKEAL RLHPPTPLML PHRASASVKI 
421 FRPERFLEED VDMKGHDYRL LPFGAGRRVC 
481 PEDIDLEESP GTVTYMKNPI QAIPTPRLPA 



LYHRLRFKLP PGPRPLPWG NLYDIEPVRP 
LAKEVLKE^JD QNLADRFRTR PANNLSRNGM 
liRPIREDEVT AMVENIFKDC TKPDNTGKSL 
DEQGQEFKGI VSNGIKIGGK LPLAEYVPWL 
iju:u<:ktgdtk QHFVDALLTL QKQYDLSDDT 
PRVQLKAQEE LDRVIGTDRI MSETDFSKLP 
GGYDIPKGSI VHVNVWAVAR DPAVWKNPLE 
PGAQLAINLV TSMLGHLLHH FTWAPAPGVN 
HLVGRVPVDM 
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NAME D250-AC11 

ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 257 

1 ATAATGCTCT TTCTACTCTT TGTAGCCCTT CCTTTCATTC TTATTTTTCT TCTTCCTAAA 

61 TTCA^iAAATG GTGGAAATAA CAGATTGCCA CCAGGTCCTA TAGGTTTACC ATTCATTGGA 

121 AATTTGCATC AATATGATAG TATAACTCCT CATATCTATT TTTGGAAACT TTCCAAAAAA 

181 TATGGCAAAA TCTTCTCATT AAAACTTGCT TCTACTAATG TGGTAGTAGT TTCTTCAGCA 

241 AAATTAGCAA AAGAAGTATT GAAAAAACAA GATTTAATAT TTTGTAGTAG ACCATCTATT 

3 01 CTTGGCCAAC AAAAACTGTC TTATTATGGT CGTGATATTG CTTTTGCACC TTATAATGAT 

361 TATTGGAGAG AAATGAGAAA AATTTGTGTT CTTCATCTTT TTAGTTTAAA AAAAGTTCAA 

421 TTATTTAGTC CAATTCGTGA AGATGAAGTT TTTAGAATGA TTAAGAAAAT ATCAAAACAA 

481 GCTTCTACTT CACAAATTAT TAATTTGAGT AATTTAATGA TTTCATTAAC AAGTACAATT 

541 ATTTGTAGAG TTGCTTTTGG TGTTAGGTTT GAAGAAGAAG CACATGCAAG GAAGAGATTT 

601 GATTTTCTTT TGGCCGAGGC ACAAGAAATG ATGGCTAGTT TCTTTGTATC TGATTTTTTT 

661 CCCTTTTTAf^ GTTAGATTGA CAAATTAAGT GGATTGACAT ATAGACTTGA GAGGAATTTC 

721 AAGGATTTGG ATAATTTTTA TGAAGAACTC ATTGAGCAAC ATCAAAATCC TAATAAGCCA 

781 AAATATATGG AAGGAGATAT TGTTGATCTT TTGCTACAAT TGAAGAAAGA GAAATTAACA 

841 CCACTTGATC TCACTATGGA AGATATAAAA GGAATTCTCA TGAATGTGTT AGTTGCAGGA 

901 TCAGACACTA GTGCAGCTGC TACTGTTTGG GCAATGACAG CCTTGATAAA GAATCCTAAA 

961 GCCATGGAAA AAGTTCAATT AGAAATCAGA AAATCAGTTG GGAAGAAAGG CATTGTAAAT 

1021 GAAGAAGATG TCCAAAACAT CCCTTATTTT AAAGCAGTGA TAAAGGAAAT ATTTAGATTG 

1081 TATCCACCAG CTCCACTTTT AGTTCCAAGA GAATCAATGG AAAAAACCAT ATTAGAAGGT 

1141 TATGAAATTC GGCCAAGAAC CATAGTTCAT GTTAACGCTT GGGCTATAGC AAGGGATCCT 

1201 GAAATATGGG AAAATCCAGA TGAATTTATA CCTGAGAGAT TTTTGAATAG CAGTATCGAT 

1261 TACAAGGGTC AAGATTTTGA GTTACTTCCA TTTGGTGCAG GCAGAAGAGG TTGCCCAGGT 

1321 ATTGCACTTG GGGTTGCATC CATGGAACTT GCTTTGTCAA ATCTTCTTTA TGCATTTGAT 

13 81 TGGGAGTTGC CTTATGGAGT GAAAAAAGAA GACATCGACA CAAACGTTAG GCCTGGAATT 

1441 GCCATGCACA AGAAAAACGA ACTTTGCCTT GTCCCAAAAA AATTATTTAT AAATTATATT 

1501 GGGACGTGGA TCTCATGCTA GTTCTGTGCG GTCAGCTAAG CTTA 



SEQ. ID. NO. 258 

1 MLFLLFVALP FILIFLLPKF KNGGNNRLPP GPIGLPFIGN LHQYDSITPH lYFWKIiSKKY 

61 GKIFSLKLAS TNVWVSSAK LAKEVLKKQD LIFCSRPSIL GQQKLSYYGR DIAFAPYNDY 

121 WREl^KICVL HLFSLKKVQL FSPIREDEVF RMIKKISKQA STSQIINLSN LMISLTSTII 

181 CRVAFGVRFE EEAHARKRFD FLLAEAQEMM ASFFVSDFFP FLS . IDKLSG LTYRLERNFK 

241 DLDNFYEELI EQHQNPNKPK YMEGDIVDLL LQLKKEKLTP LDLTMEDIKG ILMNVXiVAGS 

3 01 DTSAAATVWA MTALIKNPKA MEKVQLEIRK SVGKKGIVNE EDVQNIPYFK AVIKEIFRLY 

3 61 PPAPLLVPRE SMEKTILEGY EIRPRTIVHV NAWAIARDPE IWENPDEFIP ERFLNSSIDY 

421 KGQDFELLPF GAGRRGCPGI ALGVASMELiA LStJLLYAFDW ELPYGVKKED IDTNVRPGIA 

481 MHKKNELCLV PKKLFINYIG TWISC 
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NAME D259-AB9 
ORGANISM NICOTIANA TABACUM 
SEQ. ID, NO. 259 

1 CACATTGAGT CCTCTCCCAA ATCACTGATT CACCACCAAA AGTACCAACA ATTCAATGGA 
61 AGGTACAAAC TTGACTACAT ATGCAGCAGT ATTTCTTGAT ACTCTGTTTC TTTTGTTCCT 
121 TTCCAAACTT CTTCGCCAGA GGAAACTCAA TTTACCTCCA GGCCCAAAAC CATGGCCGAT 
181 CATCGGAAAC TTAAACCTTA TTGGCAATCT TCCTCATCGC TCAATCCACG AACTCTCCCT 
241 CAAGTACGGA CCCGTTATGC AACTCCAATT CGGGTCTTTC CCCGTTGTAG TTGGATCCTC 
301 CGTCGAAATG GCTAAGATTT TCCTCAAATC CATGGATATT AACTTTGTAG GCAGGCCTAA 
3 61 AACGGCTGCC GGAAAATACA CAACGTACAA TTATTCCGAT ATTACATGGT CTCCTTACGG 
421 ACCATATTGG CGCCAGGCAC GTAGGATGTG CCTAACGGAA TTATTCAGCA CGAAACGTCT 
481 CGATTCATAC GAGTATATTC GGGCTGAGGA GTTGCATTCT CTTCTCCATA ATTTGAACAA 
541 AATATCAGGG AAACCAATTG TGTTGAAAGA TTATTTGACG ACGTTGAGTT TAAATGTTAT 
601 TAGCAGGATG GTACTGGGGA AAAGGTATTT GGACGAATCC GAGAACTCGT TCGTGAATCC 

6 61 TGAGGAATTT AAGAAGATGT TGGACGAATT GTTTl'TGCTA AATGGTGTAC TTAATATTGG 

7 21 AGATTCAATT CCATGGATTG ATTTCATGGA TTTGCAAGGT TATGTTAAGA GGATGAAAGT 

7 81 AGTGAGCAAG AAATTCGACA AGTTTTTAGA GCATGTTATT GATGAGCATA ACATTAGGAG 

8 41 AAATGGAGTG GAGAATTATG TTGCTAAGGA TATGGTGGAT GTTTTGTTGC AGCTTGCTGA 
901 TGATCCGAAG TTGGAAGTTA AGCTGGAGAG ACATGa?VGTC AAAGCATTCA CTCAGGATAT 
961 GCTGGCTGGT GGAACCGAGA GTTCAGCAGT GACAGTGGAG TGGGC AATTT CAGAGCTGCT 

1021 AAAGAAGCCG GAGATTTTCA AAAAGGCTAC AGAAGAATTG GATCGAGTAA TTGGGCAGAA 
1081 TAGATGGGTA CAAGAAAAGG ACATTCCAAA TCTTCCTTAC ATAGAGGCAA TAGTCAAAGA 
1141 GACTATGCGA CTGCACCCCG TGGCACCAAT GTTGGTGCCA CGTGAGTGTC GAGAAGATAT 
1201 TAAGGTAGCA GGCTACGACG TTCAGAAAGG AACTAGGGTT CTCGTGAGTG TATGGACTAT 

12 61 TGGAAGAGAC CCTACATTGT GGGhCGAGCC TGAGGTGTTC AAGCCGGAGA GATTCCATGA 

13 21 AAAGTCCATA GATGTTAAAG GACATGATTA TGAGCTTTTG CCATTTGGAG CGGGGAGAAG 
1381 AATGTGCCCG GGTTATAGCT TGGGGCTCAA GGTGATTCAA GCTAGCTTAG CTAATCTTCT 
1441 ACATGGATTT AACTGGTCAT TGCCTGATAA TATGACTCCT GAGGACCTCA ACATGGATGA 
15 01 GATTTTTGGG CTCTCTACAC CTAAAAAATT TCCACTTGCT ACTGTGATTG AGCCAAGACT 
1561 TTCACCAAAA CTTTACTCTG TTTGATTCAG CAGTTCTATG GTTCCGTCAA GAT AG 

SEQ. ID. NO. 260 

1 MEGTNLTTVA AVFLDTLFLL FLSKLLRQRK LNLPPGPKPW PIIGNXiNLIG NLPHRSIHEL 

61 SLKYGPVMQL QFGSFPWVG SSVEMAKIFL KSMDINFVGR PKTAAGKYTT YNYSDITWSP 

121 YGPYVmQARR MCLTELFSTK RLDSYEYIRA EELHSLLHNL NKISGKPIVL K0YLTTLSLN 

181 VISRMVX^GKR YLDESENSFV NPEEFKKMLD ELFLLNGVLN IGDSIPWIDF MBLQGYVKRM 

241 KWSKKFDKF LEHVIDEHNI RRNGVENYVA KDMVDVLLQL ADDPKLEVKL ERHGVKAFTQ 

301 DMLAGGTESS AVTVEWAISE LLKKPEIFKK ATEELDRVIG QNRWVQEKDI Pl^TLPYIEAIV 

3 61 KETMRLHPVA PMLVPRECRE DIKVAGYDVQ KGTRVLVSVW TIGRDPTLWD EPEVFKPERF 

421 HEKSIDVKGH DYELLPFGAG RRMCPGYSLG LKVIQASLAN LLHGFNWSLP DNMTPEDLNM 
481 DEIFGLSTPK KFPLATVIEP RLSPKLYSV 
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NAME D218A-AC2 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 261 

1 CTTCTTCCTT CCTAACTAAA AATGGAGATT CAGTTTTCTA ACTTAGTTGC ATTCTTGCTC 
61 TTTCTCTCCA GCATCTTTCT TGTATTCAAA AAATGGAAAA CCAGAAAACT AAATTTGCCT 
121 CCTGGTCCAT GGAAATTACC TTTTATTGGA AGTTTACACC ATTTGGCTGT GGCAGGTCCA 
181 CTTCCTCACC ATGGCCTAAA AAATTTAGCC AAACGCTATG GTCCTCTTAT GCATTTACAA 
241 CTTGGACAAA TTCCTACACT CGTCATATCA TCACCTCAAA TGGCAAAAGA AGTACTAAAA 
3 01 ACTCACGACC TCGCTTTTGC CACTAGACCA AAGCTTGTCG TGGCCGACAT CATTCACTAC 
3 61 GACAGCACGG ACATAGCACT TTCGCCATAC GGTGAATACT GGAGACAAAT TCGTAAAATT 
421 TGCATATTGG AACTCTTGAG TGCCAAGATG GTCAAGTTTT TTAGCTCGAT TCGCCAAGAT 
481 GAGCTCTCGA AGATGGTTTC ATCTATACGA ACGACGCCCA ATCTTCCAGT CAATCTTACC 
541 GACAAGATTT TTTGGTTTAC GAGTTCGGTA ATTTGTAGAT CAGCTTTAGG GAAGATATGT 
601 GGTGACCAAG ACAAATTGAT CATTTTTATG AGGGAAATAA TATCATTGGC AGGTGGATTT 
661 AGTATTGCTG ATTTTTTCCC TACATGGAAA ATGATTCATG ATATTGATGG TTCAAAATCT 
721 AAACTGGTGA AGGCACATCG TAAGATTGAT GAAATTTTGG AAAATGTGGT AAATGAGCAC 
. 7 81 AAACAGAATC GAGCAGATGG TAAAAAGGGT AATGGTGAAT TTGGTGGAGA AGATCTGATT 
841 GATGTTTTGT TAAGAGTTAG AGAAAGTGGA GAAGTTCAAA TTCCAATCAC AGATaA.CAAT 
901 ATCAAATCAA TATTAATCGA CATGTTCTCT GCCGGATCGG AAACATCATC GACAACTATA 
961 ATTTGGGCAT TAGCTGAAAT GATGAAGAAA CCAAGTGTTT TAGCAAAGGC ACAAGCTGAA 
1021 GTGAGCCAAG CTTTGAAGGG GAAGAAAATT AGTTTTCAAG AGATTGATAT TGATAAGCTA 
1081 AAGTATTTGA AGTTAGTGAT CAAAGAAACT TTAAGAATGC ACCCTCCAAT TCCTCTGTTA 
1141 GTCCCTAGAG AATGTATGGA AGATACAAAG ATTGATGGTT ACAATATACC TTTCAAAACA 
1201 AGAGTCATTG TTAATGCATG GGCAATTGGA CGAGATCCTC AAAGTTGGGA TGATCCTGAA 
12 61 AGCTTTACGC CAGAa^^GATT TGAGAATAAT TCTATTGATT TTCTTGGAAA TCATCATCAA 
1321 TTTATTCCAT TTGGTGCAGG AAGAAGGATT TGTCCTGGAA TGCTATTTGG TTTAGCTAAT 
1381 GTTGGACAAC CTTTAGCTCA GTTACTTTAT CACTTCGATT GGAAACTCCC TAATGGACAA 
1441 ACTCACCAAA ATTTCGACAT GACTGAGTCA CCTGGAATTT CTGCTACAAG AAAGGATGAT 
1501 CTTATTTTGA TTGCCACTCC TGCTCATTCT TGATTAAGTA TTGCTGCTTT TCTATTGGAG 
1561 AATTTTCAAA ATTCATCCAC AATATATAGT GTTTGCTAGA GTTGGTTAGC 



SEQ. ID. NO. 262 

1 MEIQFSNLVA FLLFLSSIFL VFKKWKTRKL NLPPGPWKLP FIGSLHHLAV AGPLPHHGLK 
61 NLAKRYGPLM HLQLGQIPTL VISSPQMAKE VLKTHPLAFA TRPKLWADI IHYDSTDIAL 
121 SPYGEYVJRQI RKICILELLS AKMVKFFSSI RQDELSKMVS SIRTTPNLPV rJLTDKIFWFT 
181 SSVICRSALG KICGDQDKLI IFMREIISLA GGFSIADFFP TWK14IHDIDG SKSKLVKAHR 
241 KIDEILENW rTEKKQlTRADG KKGNGEFGGE DLIDVLLRVR ESGEVQIPIT DDNIKSILID 

3 01 MFSAGSETSS TTIIWALAEM bIKKPSVLAKA QAEVSQALKG KKISFQEIDI DKhKYLKLVl 
361 KETLRMHPPI PLLVPRECME DTKIDGYNIP FKTRVIVNAW AIGRDPQSWD DPESFTPERF 
421 ENNSIDFLGN HHQFIPFGAG RRICPGMLFG LANVGQPLAQ LLYHPDWKLP NGQTHQNFDM 

4 81 TESPGISATR KDDLILIATP AHS 
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NAME D210-BD4 

ORG AIT ISM NICOTIAN A TABACUM 

SEQ. ID» NO. 263 

1 CTTTCATCAT ATGGCATGAA ATGGGAAATG CTCACAACAG CAAAATTGCA GCAATCTGTT 
61 TGATAATTTT CTTGGTATAT AAAGCATGGG AATTGTTGAA GTGGATATGG ATTAAGCCAA 
121 AGAAACTGGA GAGTTGCCTC AGAAAACAGG GACTCAAAGG AAATtCCTAC GGGCTATTCT 
181 ATGGAGATAT GAAAGAATtG TCCAAAAGTC TCAAGGAAAT CAATTCAAAG CCCATCATCA 
241 ATCTATCAAA TGAAGTAGCC CCAAGAATCA TTCCTTATtA TCTTGAAATC ATCCAAAAAT 
301 ATGGTAAAAG ATGTTTTGTT TGGCAAGGAC CAACCCCCGC AATATTAATA ACAGAGCCAG 
3 61 AATTAATAAA GGAGATATTT GGTAAGAACT ATGTTTTTCA GAAGCCTAAT AATCCCAACC 
421 CACTGACCAA GTTATTGGCT CGAGGTGTTG TAAGCTACGA GGAAGAAAAA TGGGCAAAAC 
481 ACAGAAAGAT CTTAAATCCT GCCTTTCATA TGGAGAAGTT GAAGCATATG CTACCAGCAT 
541 TTTACTTGAG CTGTAGTGAG ATGCTGAACA AATGGGAGGA GATTATCCCA GTAAAAGAAT 
601 CAAATGAGTT GGACATTTGG CCTCATCTTC AAAGAATGAC AAGTGATGTG ATTTCTCGTG 
661 CTGCCTTTGG TAGTAGCTAC GAAGAAGGAA GAAGAATATT TGAACTTCAA GAAGAACAAG 
721 CTGAGTATCT AACGAAGACA TTCAATTCAG TTTATATCCC AGGTTCCAGA TTTTTTCCCA 
781 ATAAAATGAA CAAAAGAATG AAAGAATGTG AAAAGGAAGT ACGAGAAACA ATTACGTGTC 
841 TAATTGACAA CAGATTAAAG GCAAAAGAAG AAGGCAATGG CAAGGCCCTC AATGATGACC 
901 TATTGGGTAT ATTATTAGAG TCAAATTCTA TAGAAATTGA AGAACATGGT AACAAGAAGT 
961 TTGGAATGAG TATACCTGAA GTAATTGAAG AGTGCAAATT ATTCTATTTT GCTGGCCAAG 
1021 AGACTACATC AGTATTGCTT GTGTGGACAC TGATTTTGTT AGGGAGAAAt cCAGAATGGC 
1081 AGGAACGTGC TAGAGAGGAA GTTTTTCAAG CCTTTGGAAG TGATAAACCA ACTTTTGACG 
1141 AATTATATCG CTTGAAAATT GTGACGATGA TTTTGTACGA GTCTTTAAGG TTATATCCAC 
1201 CAATAGCAAC TCGTACTCGA AGGACTAATG AAGAAACAAA ATTAGGGGAA CTAGATTTAC 
1261 CAAAGGGTGC ACTGCTCTTT ATACCAACAA TCTTATTACA TCTTGACAGG GAAATTTGGG 
1321 GTGAAGATGC AGATGAGTTC AATCCGGAGA GATTTAGCGA AGGGGTGGCA AAGGCAACAA 
1381 AGGGGAAAAT GACATATTTT CCATTTGGTG C^GGhCCGCG AAAATGCATT GGGC AAAACT 
1441 TCGCGATTTT GGAAGCAAAA ATGGCTATAG CTATGATTCT ACAACGCTTC TCCTTCGAGC 
1501 TCTCTCCATC TTATACACAC TCTCCATACA CTGTGGTCAC TTTGAAACCC AAATATGGTG 
1561 CTCCCCTAAT AATGCACAGG CTGTAGTCCT GTGAGAATAT GCTATCCGAG G 

SEQ. ID. NO. 264 

1 MGNAHNSKIA AICLIIFLW KAWELLKWIW IKPKKLESCL RKQGLKGNSY GLFYGDMKEL 
61 SKSLKEINSK PIINLSNEVA PRIIPYYLEI IQKYGKRCFV WQGPTPAILI TEPELIKEIF 
121 GKNYVFQKPN NPNPLTKLLA RGWSYEEEK WAKHRKILNP AFHHEKLKHM LPAFYLSCSE 
181 MLNKWEEIIP VKESNEbDIW PHLQRMTSDV ISRAAFGSSY EEGRRIFELQ EEQAEYLTKT 
241 FNSVYIPGSR FFPNKMNKRM KECEKEVRET ITCLIDNRLK AKEEGNGKAL NDDLLGILLE 
301 SNSIEIEEHG NKKFGMSIPE VIEECKLFYF AGQETTSVLL VWTLILLGRN PEWQERAREE 
361 VFQAFGSDKP TFDELYRLKI VTMILYESLR LYPPIATRTR RTNEETKLGE LDLPKGALLP 
421 IPTILLHLDR EIWGEDADEF NPERFSEGVA KATKGKMTYF PFGAGPRKCI GQNFAILEAK 
481 MAIAMILQRF SFELSPSYTH SPYTWTLKP KYGAPLIMHR L 
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NAME D233-AG7 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 265 

1 CTCATTATCC ATCACCTAAA ATGGAGAATT CTTGGGTTTT TCTAGCCTTG 
61 CTGCATTAGC TTTTCTCTGT AAAATAATCA CCTGTCGAAG ACCGGTTAAC 
121 CACCAGGTCC AAAACCATGG CCCATCATTG GCAATTTGAA CCTACTTGGT 
181 ATCAATCTTT TGACTTGCTT TCCAAAAAAT ATGGAGAGTT GATGCTGCTG 
241 CCAGGCCAGT TCTTGTTGCT TCATCTGCTG AAATGGCAAA ACAGTTTTTA 
301 ATGCTAATTT CGCCTCCCGT CCTATGCTAG CTGGTGGAAA GTATACAAGC 
361 GTGACATGAC ATGGGCACCC TATGGTCCCT ATTGGCGCCA AGCACGACGA 
421 ACCAGATATT TACTCCGAAA AGGCTAGACT CGTTCGAGTA CATTCGTGTT 
481 AGGCCTTGAT TTCCCAGCTG AATTCCCTTG CTGGAAAGCC ATTTTTTCTC 
541 TffTCGCGATT TAGCCTCTGC AGCATGACAA GG ATGGTTTT GAGCAACAAG 
601 AATCAACAGT TAGAGTAGAA GATTTGCAGT ACCTGGTAGA TCAATGGTTC 
661 GTGCTTTCAA CATTGGAGAT TGGATTCCAT GGCTCAGCTT CTTGGACCTA 
721 TGAAACAAAT GAAGGCTTTG AAAAGAACTT TTGATAAGTT CCACAACATT 
781 ATCACAGGGC TAAGAAGAAT GCAGAGAAGA ACTTTGTCCC AAAAGACATG 
841 TGTTGAAGAT GGCTGAAGAT CCTAATCTGG AAGTCAAACT CACTAATGAC 
901 GGTTAATGCA GGATTTACTA ACTGGAGGAA CAGATAGCTT AACAGCAGCA 
961 CATTTCAAGA ACTTCTTAGA CAGCCAAGGG TTATTGAGAA GGC AACCGAA 
1021 GGATTGTCGG GAAAGAGAGA TGGGTAGAAG AGAAAGATTG CTCGCAGCTA 
1081 AAGCAATCCT CAAGGAAACA CTAAGGTTAC ATCCTCTAGG AACTATGCTA 
1141 GTGCTATAGA AGATTGTAAC GTGGCTGGTT ATa^CATACA GAAAGGAACG 
1201 TGAATGTTTG GACCATTGGA AGGGACCCAA AATACTGGGA TAGAGCACAA 
1261 CCGAG AGATT TTTAGAGAAC GACATTGATA TGGACGGACA TAACTTTGCT 
1321 TTGGCTCGGG GCGAAGGAGG TGCCCTGGCT ATAGCCTTGG ACTTAAGGTT 
1381 CATTAGCCAA CATGTTGCAT GGATTCAACT GGAAATTACC TGAAGGTATG 
1441 ATATAAGTGT GGAAGAACAT TATGGGCTCA CTACACATCC TAAGTTTCCT 
1501 TCTTGGAATC TAGACTTTCT TCAGATCTCT ATTCCCCCAT CACTTAATCC 
1561 CTATTATAGC 



GCAGGGCTAT 
CGG AAAATAC 
CCTATACCAC 
AAATTTGGCT 
AAAGTACATG 
TATAACTATT 
ATTTACCTTA 
GAAGAAAGGC 
AAAGACCATT 
TATTTTGGTG 
TTACTTAATG 
CAAGGCTATG 
GTGCTAGATG 
GTTGATGTCT 
TGTGTCAAAG 
GTGCAATGGG 
GAGCTTGACC 
TCTTACGTTG 
GCACCGCATT 
ACCTTTCTGG 
GAGTTTCTCC 
TTCTTGCCAT 
ATCCGAGTAA 
AAGCCAGAAG 
GTTCCTGTGA 
TAAGTGCTTC 



SEQ. ID. NO. 266 

1 MENSWVFLAL AGLSALAFLC KIITCRRPVN 
61 SKKYGELMLL KFGSRPVLVA SSAEt^AKQFL 
121 YGPYWRQARR lYLNQIFTPK RLDSFEYIRV 
181 SMTRMVLSITK YFGESTVRVE DLQYLVDQWF 
241 KRTFDKFHNI VLDDHRAKKN AEKNFVPKDM 
301 TGGTDSLTAA VQWAFQELLR QPRVIEKATE 
361 LRLHPLGTML APHCAIEDCN VAGYDIQKGT 
421 DIDMDGHNFA FLPFGSGRRR CPGYSLGLKV 
481 YGLTTHPKFP VPVILESRLS SDLYSPIT 



RKIPPGPKPW PIIGNLNLLG PIPHQSFDLL 
KVHDANFASR PMLAGGKYTS YNYCDMTWAP 
EERQALISQL NSLAGKPFFL KDHLSRFSLC 
LLNGAFNIGD WIPWLSFLDL QGYVKQMKAL 
VDVLLKMAED PNLEVKLTND CVKGLMQDLL 
ELDRIVGKER WVEEKDCSQL SYVEAILKET 
TFLVNVWTIG RDPKYWDRAQ EFLPERFLEN 
IRVTLANMLH GFNWKLPEGM KPEDISVEEH 
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NAME D257-AE4 
ORGANISM NICOTIANA 
SEQ. ID. NO. 267 

1 CACATTGAGT 
61 AGGTACAAAC 
121 TTCCAAACTT 
181 CATCGGAAAC 
241 CAAGTACGGA 
3 01 CGTCGAAATG 
3 61 AACGGCTGCC 
421 ACCATATTGG 
481 CGATTCATAC 
541 AATATCAGGG 
601 TAGCAGGATG 
661 TGAGGAATTT 
721 AGATTCAATT 
7 81 AGTGAGCAAG 
841 AAATGGAGTG 
901 TGATCCGAAG 
961 GCTGGCTGGT 
1021 AAAGAAGCCG 
1081 TAGATGGGTA 
1141 GACTATGCGA 
1201 TAAGGTAGCA 
1261 TGGAAGAGAC 
1321 AAAGTCCATA 
1381 AATGTGCCCG 
1441 ACATGGATTT 
1501 GATTTTTGGG 
1561 TTCACCAAAA 

SEQ, ID. NO, 268 

1 MEGTNLTTYA 
61 SLKYGPVMQL 
121 YGPYWRQARK 
181 VISRMVLGKR 
241 KWSKKFDKF 
301 DMLAGGTESS 
361 KETMRLHPVA 
421 HEKSIDVKGH 
481 DEIFGLSTPK 



TABACUM 



CCTCTCCCAA 
TTGACTACAT 
CTTCGCCAGA 
TTAAACCTTA 
CCCGTTATGC 
GCTAAGATTT 
GG AAAATACA 
CGCCAGGCAC 
GAGTATATTC 
AAACCAATTG 
GTACTGGGGA 
AAGAAGATGT 
CCATGGATTG 
AAATTCGACA 
GAGAATTATG 
TTGGAAGTTA 
GGAACCGAGA 
GAGATTTTCA 
CAAGAAAAGG 
CTGCACCCCG 
GGCTACGACG 
CCTACATTGT 
GATGTTAAAG 
GGTTATAGCT 
AACTGGTCAT 
CTCTCTACAC 
CTTTACTCTG 



ATCACTGATT 
ATGCAGCAGT 
GGAAACTCAA 
TTGGCAATCT 
AACTCCAATT 
TCCTCAAATC 
CAACGTACAA 
GTAGGATGTG 
GGGCTGAGGA 
TGTTGAAAGA 
AAAGGTATTT 
TGGACGAATT 
ATTTCATGGA 
AGTTTTTAGA 
TTGCTAAGGA 
AGCTGGAGAG 
GTTCAGCAGT 
AAAAGGCTAC 
ACATTCCAAA 
TGGCACCAAT 
TTCAGAAAGG 
GGGACGAGCC 
GACATGATTA 
TGGGGCTCAA 
TGCCTGATAA 
CTAAAAAATT 
TTTGATTCAG 



CACCACCAAA 
ATTTCTTGAT 
TTTACCTCCA 
TCCTCATCGC 
CGGGTCTTTC 
CATGGATATT 
TTATTCCGAT 
CCTAACGGAA 
GTTGCATTCT 
TTATTTGACG 
GGACGAATCC 
GTTTTTGCTA 
TTTGCAAGGT 
GCATGTTATT 
TATGGTGGAT 
ACATGGAGTC 
GACAGTGGAG 
AGAAGAATTG 
TCATCcTTAC 
GTTGGTGCCA 
AACTAGGGTT 
TGAGGTGTTC 
TGAGCTTTTG 
GGTGATTCAA 
TATGACTCCT 
TCCACTTGCT 
CAGTTCTATG 



AGTACCAACA 
ACTCTGTTTC 
GGCCCAAAAC 
TCAATCCACG 
CCCGTTGTAG 
AACTTTGTAG 
ATTACATGGT 
TTATTCAGCA 
CTTCTCCATA 
ACGTTGAGTT 
GAGAACTCGT 
AATGGTGTAC 
TATGTTAAGA 
GATGAGCATA 
GTTTTGTTGC 
AAAGCATTCA 
TGGGCAATTT 
GATCGAGTAA 
ATAGAGGCAA 
CGTGAGTGTC 
CTCGTGAGTG 
AAGCCGGAGA 
CCATTTGGAG 
GCTAGCTTAG 
GAGGACCTCA 
ACTGTGATTG 
GATCCGTCAA 



ATTCAATGGA 
TTTTGTTCCT 
CATGGCCGAT 
AACTCTCCCT 
TTGGATCCTC 
GCAGGCCTAA 
CTCCTTACGG 
CGAAACGTCT 
ATTTGAACAA 
TAAATGTTAT 
TCGTGAATCC 
TTAATATTGG 
GGATGAAAGT 
ACATTAGGAG 
AGCTTGCTGA 
CTCAGGATAT 
CAGAGCTGCT 
TTGGGCAGAA 
TAGTCAAAGA 
GAGAAGATAT 
TATGGACTAT 
GATTCCATGA 
CGGGGAGAAG 
CTAATCTTCT 
ACATGGATGA 
AGCCAAGACT 
GATAGAC 



AVFLDTLFLL 
QFGSFPVWG 
MCLTELFSTK 
\"LDESENSFV 
LEHVIDEHNI 
AVTVEWAISE 
PMLVPRECRE 
DYELLPFGAG 
KFPLATVIEP 



FLSKLLRQRK 
SSVEMAKIFL 
RLDSYEYIRA 
NPEEFKKMLD 
RRNGVENYVA 
LLKKPEIFKK 
DIKVAGYDVQ 
RRMCPGYSLG 
RLSPKLYSV 



LNLPPGPKPW 
KSMDINFVGR 
EELHSLLHNL 
ELFLLNGVLN 
KDMVDVLLQL 
ATEELDRVIG 
KGTRVLVSVW 
LKVIQASLAN 



PIIGNLNLIG 
PKTAAGKYTT 
NKISGKPIVL 
IGDSIPWIDF 
ADDPKLEVKL 
QNRWVQEKDI 
TIGRDPTLWD 
LLHGFNWSLP 



NLPHRSIHEL 
\TIYSDITWSP 
KDYLTTLSLN 
MDLQGYVKRM 
ERHGVKAFTQ 
PNHPYIEAIV 
EPEVFKPERF 
DNMTPEDLNM 
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NAME D268-AE2 

ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 269 

1 TGCAATATAG TTTTCCTAGT CAGTTCTAGC CTCCTTTTCC TTAGAAATAA TGGATTATCA 

61 TATTTCTTTC CATTTTCAAG CTCTTTTAGG GCTTTTAGCC TTTGTGTTCT TGTCTATTAT 

121 CTTATGGAGA AGAACACTCA CTTCAAGAAA ATTAGCCCCT GAAATCCCAG GGGCATGGCC 

181 TATTATAGGC CATCTTCGTC AGCTGAGTGG TACTGATAAG AATATCCCAT TTCCCCGAAT 

241 ATTGGGCGCT TTGGCAGATA AATATGGACC TGTCTTCACA CTGAGAATAG GGATGTACCC 

301 CTATTTGATT GTCAACAATT GGGAAGCAGC TAAGGATTGT CTCACAACGC ATGATAAGGA 

3 61 CTTCGCTGCC CGACCAACTT CTATGGCTGG TGAAAGCATC GGGTACAAGT ATGCGAGGTT 

421 TACTTATGCT AATTTTGGTC CTTATTATAA CCAAGTGCGC AAACTAGCCC TACAACATGT 

481 ACTCTCGAGT ACTAAACTCG AGAAAATGAA ACACATACGT GTTTCTGAAT TGGAAACTAG 

541 CATCAAAGAA TTATATTCTT TGACGCTGGG CAAAAACAAC ATGCAAAAAG TGAATATAAG 

601 TAAATGGTTT GAACAATTGA CTTTAAACAT AATCGTGAAG ACAATTTGTG GCAAGAGATA 

661 TAGCAACATA GAGGAGGATG AAGAGGCACA ACGTTTCAGA AAGGCATTTA AGGGCATCAT 

721 GTTTGTTGTA GGGCAAATTG TTTTATATGA CGCAATTCCA TTCCCATTGT TCAAATACTT 

781 TGATTTCCAA GGTCATATAC AATTGATGAA CAAAATTTAT AAAGACTTAG ATTCTATTCT 

841 TCAAGGATGG TTGGATGATC ATATGATGAA CAAGGATGTA AACAATAAGG ATCAAGATGC 

901 CATAGATGCC ATGCTTAAGG TAACACAACT TAATGAATTC AAAGCCTATG GTTTTTCTCA 

961 GGCCACTGTG ATCAAGTCGA CAGTCTTGAG TTTGATCTTA GATGGAAATG ACACAACCGC 

1021 TGTTCATTTG ATATGGGTAA TGTCCTTATT ACTGAACAAT CCACATGTTA TGAAACAAGG 

1081 CCAAGAAGAG ATAGACATGA AAGTGGGTAA AGAGAGGTGG ATTGAAGATA CTGACATAAA 

1141 AAATTTAGTG TACCTTCAGG CTATCGTTAA AGAGACATTG CGCTTGTATC CACCTGTTCC 

1201 TTTTCTTTTA CCACACGAAG CAGTGCAAGA TTGTAAAGTG ACTGGTTACC ACATTCCTAA 

12 61 AGGTACTCGT CTATATATCA ATGCGTGGAA AGTACATCGC GATTCTGAAA TTTGGTCAGA 

1321 GCCCGAAAAG TTTATGCCCA ATAGATTCTT GACTAGCAAA GCAAATATAG ATGCTCGCGG 

1381 TCAAAATTTT GAATTTATAC CGTTTGGTTC TGGGAGACGG TCATGTCCAG GGTTAGGTTT 

1441 TGCGACTTTA GTGACACATC TGACTTTTGG TCGCTTGCTT CAAGGTTTTG ATTTTAGTAA 

1501 GCCATCAAAC ACGCCAATTG ACATGACAGA AGGCGTAGGC GTTACTTTGC CTAAGGTTAA 

1561 TCAAGTTGAA GTTCTAATTA CCCCTCGTTT ACCTTCTAAG CTTTATTTAT TTTGAAAGTG 

1621 CAAATCATCA ATCATGGGTT GAGTAATTAG TGATACT 

SEQ, ID. NO. 270 

1 MDYHISFHFQ ALLGLLAFVF LSIILWRRTL TSRKLAPEIP GAWPIIGHLR QLSGTDKNIP 
61 FPRILGALAD FT/GPYFTLRI Gt'IYPrLIVim WEAAKDCLTT HDKDFAARPT SMAGESIGYK 
121 YARFTYANFG PYYNQVRKLA LQHVIiSSTKli EKMKHIRVSE LETSIKELYS LTLGKNNMQK 
181 VNISKWFEQL TLNIIVKTIC GKRYSNIEED EEAQRFRKAF KGIMFWGQI VLYDAIPFPL 
241 FKYFDFQGHI QLMNKIYKDL DSILQGWLDD HMMNKDVNNK DQDAIDAMLK VTQLNEFKAY 
301 GFSQATVIKS TVLSLILDGN DTTAVHLIWV MSLLLNNPHV MKQGQEEIDM KVGKERWIED 
361 TDIKNLVYLQ AIVKETLRLY PPVPFLLPHE AVQDCKVTGY HIPKGTRLYI NAWKVHRDSE 
421 IWSEPEKFMP NRFLTSKANI DARGQNFEFI PFGSGRRSCP GLGFATLVTH LTFGRLLQGF 
481 DFSKPSNTPI DMTEGVGVTL PKVNQVEVLI TPIUiPSKLYL F 
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NAME D283-AC1 
ORGANISM MICOTIANA TABACUM 
SEQ. ID. NO. 271 

1 AGAGAGTGAA AATGGACGCA CTACTTCAAA TGACAGTAAC AGCATCTTGT GCTGCCATAG 
61 TAATTACTCT GCTGGTGTGT ATATGGAGAG TGCTGAACTG GATTTGGTTC AGACCAAAGA 
121 AATTGGAGTT GTTGTTGAGA AAACAAGGTT TGGAAGGAAA TTCTTACAAG GTTTTGTATG 
181 GGGACATGAA AGAGTTTTCT GGGATGATTA AGGAAGCATA CTCAAAGCCT ATGAGTCTAT 
241 CTGATGATGT AGCACCAAGA CTGATGCCTT TCTTTCTTGA AACCATCAAA AAATATGGAA 
301 AAAGATCCTT TATATGGTTT GGTCCAAGAC CACTAGTATT GATTATGGAT CCTGAGCTTA 
361 TAAAGGAAGT ACTCTCAAAA ATCCATCTGT ATCAAAAGCC- TGGTGGAAAT CCATTAGCAA 
421 CACTATTGGT ACAAGGAATA GCAACCTATG AGGAAGACAA ATGGGCCAAA CATAGAAAAA 
481 TCATCAATCC CGCTTTCCAT CTAGAGAAGC TAAAGCTTAT GCTTCCAGCA TTTCGCTTAA 
541 GCTGTAGTGA GATGCTGAGC AAATGGGAAG ACATTGTTTC AGCTGATAGC TCACATGAGA 
601 TAGATGTATG GTCTCACCTT GAGCAATTGA CTTGCGATGT GATCTCTCGG ACAGCTTTTG 
661 GCAGTAGTTA TGAAGAAGGT AGAAAGATTT TTGAACTTCA AAAGGAACAA GCTCAGTATC 
721 TTGTGGAAGT TTTCCGCTCC GTTTATATCC CAGGAAGGAG ATTTTTGCCA ACAAAGAGGA 
781 ATAGAAGAAT GAAGGAAATA AAAAAGGATG TCCGGGCATC AATTAAAGGT ATTATTGATA 
841 AAAGATTGAA GGCAATGAAA GCAGGGGACA CCAATAATGA GGATCTATTG GGTATATTAC 
901 TGGAATCGaA TATTAAAGAA ATTGAACAGC ACGGAAACAA GGATTTTGGA ATGAGCATTG 
961 AAGAAGTCAT TGAAGAATGC AAGTTATTCT ATTTTGCTGG CCAAGAAACT ACATCAGTGT 
1021 TACTCCTATG GTCTCTAGTG TTGTTGAGCA GGTATCAAGA TTGGCAGGCA CGGGCCAGAG 
1081 AAGAAATCTT GCAAGTCTTT GGCAGTCGAA AACCAGATTT TGACGGATTA AATCATCTAA 
1141 AAATTGTGAC AATGATCTTG TACGAGTCTT TAAGGCTGTA TCCCTCACTA ATAACACTTA 
1201 CCCGCCGGTG TAATGAAGAC ATTGTATTAG GAGAACTATC TCTACCAGCT GGTGTTCTAG 
1261 TCTCTTTGCC ATTGATTTTG TTGCATCATG ATGAAGAGAT ATGGGGTGAA GATGCAAAGG 
1321 AGTTCAAACC AGAGAGATTT AGAGAAGGAA TATCAAGTGC AACAAAGGGT CAACTCACAT 
1381 ATTTTCCATT TAGCTGGGGT CCTAGAATAT GTATTGGACA AAATTTTGCC ATGTTAGAAG 
1441 CAAAGATGGC TCTGTCTATG ATCCTGCAAC GCTTCTCTTT TGAACTGTCT CCGTCTTATG 
1501 CACATGCCCC TCGGTCCATA ATAACCGTTC AGCCTCAGTA TGGTGCTCCA CTTATTTTCC 
1561 ACAAACTATA ATTTTGGTAC TTCTACTAAT ATTTTAGGGT TTATTCAGAC TCAAAAAAAA 

SEQ. ID. NO. 272 

1 MTVTASCAAI VITLLVCIWR V1->IWIV;FRPK KLELLLRKQG LEGNSYKVLY GDMKEFSGMI 

61 KEAYSKPMSL SDDVAPRLMP FFLETIKKYG KRSFIWFGPR PLVLIMDPEL IKEVLSKIHL, 

121 YQKPGGNPLA TLLVQGIATY EEDKWAKHRK IINPAFHLEK LKLMLPAFRL SCSEMLSKWE 

181 DIVSADSSHE IDWSHLEQL TCDVISRTAF GSSYEEGRKI FELQKEQAQY LVEVFRSVYI 

241 PGRRFLPTKR NRRMKEIKKD VRASIKGIID KRLKAMKAGD TNNEDLLGIL LESNIKEIEQ 

301 HGNKDFGMSI EEVIEECKLF YFAGQETTSV LLLWSLVLLS RYQDWQARAR EEILQVFGSR 

3 61 KPDFDGLNHL KIVTMILYES LRLYPSLITL TRRCNEDIVI* GELSLPAGVL VSLPLILLHH 

421 DEEIWGEDAK EFKPERFREG ISSATKGQLT YFPFSWGPRI CIGQNFAMLE AKMALSMILQ 
481 RFSFELSPSY AHAPRSIITV QPQYGAPLIF HKL 
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NAME D244-AB6 
ORGANISM NICOTIANA TABACUM 
SEQ, ID. NO. 273 

1 TGCAATATAG TTTTCCTAGT CAGTTCTAGC CTCCTTTTCC TTAGAAATAA TGGATTATCA 
61 TATTTCTTTC CATTTTCAAG CTCTTTTAGG GCTTTTAGCC TTTGTGTTCT TGTCTATTAT 
121 CTTATGGAGA AGAACACTCA CTTCAAGAAA ATTAGCCCCT GAAATCCCAG GGGCATGGCC 
181 TATTATAGGC CATCTTCGTC AGCTGAGTGG TACTGATAAG AATATCCCAT TTCCCCGAAT 
241 ATTGGGCGCT TTGGCAGATA AATATGGACC TGTCTTCACA CTGAGAATAG GGATGTACCC 
301 CTATTTGATT GTCAACAATT GGGAAGCAGC TAAGGATTGT CTCACAACGC ATGATAAGGA 
361 CTTggCTGCC CGACCAACTT CTATGGCTGG TGAAAGCATC GGGTACAAGT ATGCGAGGTT 
421 TACTTATGCT AATTTTGGTC CTTATTATAA CCAAGTGCGC AAACTAGCCC TACAACATGT 
481 ACTCTCGAGT ACTAAACTCG AGAAAATGAA ACACATACGT GTTTCTGAAT TGGAAACTAG 
541 CATCAAAGAA TTATATTCTT TGACGCTGGG CAAAAACAAC ATGCAAAAAG TGAATATAAG 
601 TAAATGGTTT GAACAATTGA CTTTAA^^CAT AATCGTGAAG ACAATTTGTG GCAAGAGATA 
661 TAGCAACATA GAGGAGGATG AAGAGGCACA ACGTTTCAGA AAGGCATTTA AGGGCATCAT 
721 GTTTGTTGTA GGGCAAATTG TTTTATATGA CGCAATTCCA TTCCCATTGT TCAAATACTT 
781 TGATTTCCAA GGTCATATAC AATTGATGAA CAAAATTTAT AAAGACTTAG ATTCTATTCT 
841 TCAAGGATGG TTGGATGATC ATATGATGAA CAAGGATGTA AACAATAAGG ATCAAa^TGC 
901 CATAGATGCC ATGCTTAAGG TAACACAACT TAATGAATTC AAAGCCTATG GTTTTTCTCA 
961 GGCCACTGTG ATCAAGTCGA CAGTCTTGAG TTTGATCTTA GATGGAAATG ACACAACCGC 
1021 TGTTCATTTG ATATGGGTAA TGTCCTTATT ACTGAACAAT CCACATGTTA TGAAACAAGG 
1081 CCAAGAAGAG ATAGACATGA AAGTGGGTAA AGAGAGGTGG ATTGAAGATA CTGACATAAA 
1141 AAATTTAGTG TACCTTCAGG CTATCGTTAA AGAGACATTG CGCTTGTATC CACCTGTTCC 
1201 TTTTCTTTTA CCACACGAAG GAGTGCAAGA TTGTAAAGTG ACTGGTTACC ACATTCCTAA 
1261 AGGTACTCGT CTATATATCA ATGCGTGGAA AGTACATCGC GATCCTGAAA TTTGGTCAGA 
1321 GCCCGAAAAG TTTATGCCCA ATAGATTCTT GACTAGCAAA GCAAATATAG ATGCTCGCGG 
1381 TCAAAATTTT GAATTTATAC CGTTTGGTTC TGGGAGACGG TCATGTCCAG GGATAGGTTT 
1441 TGCGACTTTA GTGACACATC TGACTTTTGG TCGCTTGCTT CAAGGTTTTG ATTTTAGTAA 
1501 GCCATCAAAC ACGCCAATTG ACATGACAGA AGGCGTAGGC GTTACTTTGC CTAAGGTTAA 
1561 TCAAGTTGAA GTTCTAATTA CCCCTCGTTT ACCTTCTAAG CTTTATTTAT TTTGAAGGTG 
1621 CAAATCATCA ATCATGGCTT GAGTAATTAG TTATACTTTA ATATGTTTCT C 



SEQ. ID. NO. 274 

1 MDYHISFHFQ ALLGLLAFVF LSIILWRRTL TSRKLAPEIP GAWPIIGHLR QLSGTDKNIP 

61 FPRILGALAD KYGPVFTLRI GMYPYLIVNN WEAAKDCLTT HDKDLAARPT SMAGESIGYK 

121 YARFTYANFG PYYT^QVRKLA LQHVLSSTKL EKMKHIRVSE LETSIKELYS LTLGKNNMQK 

181 VNISKWFEQL TU^JIIVKTIC GKRYSNIEED EEAQRFRKAF KGIMFWGQI VLYDAIPFPL 

241 FKYFDFQGHI QLMNKIYKDL DSILQGWLDD Hl^-l^TKDVlWK DQDAIDAMLK VTQLNEFKAY 

301 GFSQATVIKS TVLSLILDGN DTTAVHLIWV MSLLLNNPHV I4KQGQEEIDM KVGKERWIED 

361 TDIKNLVYLQ AIVKETLRLY PPVPFLLPHE AVQDCKVTGY HIPKGTRLYI NAWKVHRDPE 

421 IWSEPEKFMP NRFLTSKANI DARGQNFEFI PFGSGFIRSCP GIGFATLVTH LTFGRLLQGF 

481 DFSKPSMTPI DMTEGVGVTL PKVNQVEVLI TPRLPSKLYL F 
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NAME D205-BE9 
ORGANISM NICOTIANA TABACUM 
SEQ. ID, NO. 275 

1 TTTGATTCAA CCATGGAGAA CCAATACTCC TACTCATTCT CTTCCTACTT CTACTTAGCT 
61 ATAGTACTGT TTCTTCTTCC AATTTTGGTC AAATATTTCT TCCATCGGAG AAGAAATTTA 
121 CCTCCAAGTC CATTTTCTCT TCCAATAATT GGTCACCTTT ACCTTCTCAA GAAAACTCTC 
181 CATCTCACTC TAACATCCTT ATCAGCTAAA TATGGTCCTG TTTTATACCT CAAATTGGGC 
241 TCTATGCCTG TGATTGTTGT GTCCTCACCA TCTGCTGTTG AAGAATGTTT AACCAAGAAT 
301 GATATCATAT TCGCAAATAG GCCCAAGACC GTGGCTGGTG ACAAGTTTAC CTACAATTAT 
361 ACTGTTTATG TTTGGGCACC CTATGGCCAA CTTTGGAGAA TTCTTCGCCG ATTAACTGTC 
421 GTTGAACTCT TCTCTTCACA TAGCCTACAG AAAACTTCTA TCCTTAGAGA TCAAGAAGTT 
481 GCAATATTTA TCCGTTCGTT ATACAAATTC TCAAAGGATA GTAGCAAAAA AGTCGATTTG 
541 ACCAACTGGT CTTTTACTTT GGTTTTCAAT CTTATGACCA AAATTATTGC TGGGAGACAT 
601 ATTGTGAAGG AGGAAGATGC TGGCAAGGAA AAGGGCATTG AAATTATTGA AAAACTTAGA 
661 GGGACTTTCT TAGTAACTAC ATCATTCTTG AATATGTGTG ATTTCTTGCC AGTATTCAGG 
721 TGGGTTGGTT ACAAAGGGCA GGAGAAGAAG ATGGCCTCAA TTCACAATAG AAGAAATGAA 
781 TTCTTGAACA GCTTGCTTGA TGAATTTCGA CACAAGAAAA GTAGTGCTTC ACAATCTAAC 

841 acaactgttg gaaacatgga gaagaaaacc acactgattg aaaagctctt gtctcttcaa 
901 gaatcagagc ctgaattcta cactgatgat atcatcaaaa gtattatgct ggtagttttt 
961 gttgcaggaa cagagacctc atcaacaacc atccaatggg taatgaggct tcttgtagct 
1021 caccctgagg cattgtataa gctacgagct gacattgaca gtaaagttgg gaataagcgc 
1081 ttgctgaatg aatcagacct caacaagctt ccgtatttgc attgtgttgt taatgagaca 
1141 atgagattat acactccgat accactttta ttgcctcatt attcaactaa agattgtatt 
1201 gtgga;^.ggat atgatgtacc aaaacataca atgttgtttg tcaacgcttg ggccattcac 
1261 agggatccca aggtatggga ggagcctgac aagttcaagc cagagagatt tgaggcaaca 
1321 gaaggggaaa cagaaaggtt caattacaag cttgtaccat ttggaatggg gagaagagcg 
1381 tgccctggag ctgatatggg gttgcgagca gtttctttgg cattaggtgc acttattcaa 
1441 tgctttgact ggcaaattga ggaagcggaa agcttggagg aaagctataa ttctagaatg 
1501 actatgcaga acaagccttt gaaggttgtc tgcactccac gcgaagatct tggccagctt 
1561 ctatcccaac tctaaggcaa tttatcaatg ccaaacgtaa tcttcatcta ccactatg 

SEQ. ID. NO. 27 6 

1 MENQYSYSFS SYFYLAIVLF LLPILVTCfFF HRRRNLPPSP FSLPIIGHLY LLKKTLHLTL 
61 TSLSAKYGPV LYLKLGSMPV IWSSPSAVE ECLTKNDIIF ANRPKTVAGD KFTYNYTVYV 
121 WAPYGQLWRI LRRLTWELF SSHSLQKTSI LRDQEVAIFI RSLYKFSKDS SKKVDLTNWS 
181 FTLVFl^MTK IIAGRHIVKE EDAGKEKGIE IIEKLRGTFL VTTSFLNMCD FLPVFRWVGY 
241 KGQEKKMASI HNRRNEFLNS LLDEFRHKKS SASQSNTTVG NMEKKTTLIE KLLSLQESEP 
3 01 EFYTDDIIKS IMLWFVAGT ETSSTTIQWV imLLVAHPEA LYKLRADIDS KVGI^TKRLLNE 
3 61 SDLNKLPYLH CW^TETMRLY TPIPLLLPHY STKDCIVEGV DVPKHTMLFV NAWAIHRDPK 
421 WEEPDKFKP ERFEATEGET ERFNYKLVPF GMGRRACPGA DMGLRAVSLA LGALIQCFDW 
481 QIEEAESLEE SYNSRMTMQN KPLKWCTPR EDLGQLLSQL 



79/141 



wo 2005/038033 



PCT/US2004/034065 



NAME D136-AF4 
ORGANISM NICOTIANA TABACUM 
SEQ. ID, NO, 277 

1 CCTTTTTAAG ATGTATTTAA GATTTAAGAT TTAAGATGAA GCAACTGAGG TAAGTCCTTT 
61 CAAGGAGTAG TTGTCACTTC TGAGAATGGA GATGATGTAC AGCATAATAG CAGCAGCCAG 
121 TATTGCAATT ATCTTGGTAT ATACATGGAA AGTGTTGAAT TGGGCTTGGT TTGGGCCGAA 
181 GAAAATGGAG AAATGCTTAA GACAGAGGGG TCTCAAGGGA AATCCTTATA AGCTACTCTA 

2 41 TGGAGATCTA AACGAACTGA CAAAAAGCAT AATAGAAGCC AAGTCTAAGC CCATCAAriT 
301 CTCTGATGAT ATTGCTCAAA GGCTC ATCCC TTTTTTTCTT GACGCCATCA ACAAAAATGG 

3 61 TAAAAACTCC TTCGTCTGGC TTGGACCGTA TCCAATAGTG TTGATCACGG ATCCTGAGCA 
421 TTTAAAGGAG ATTTTCACAA AGAATTATGT GTATCAAAAG CAAACTCATC CCAATCCATA 
481 CGCCAAGCTA TTAGCTCACG GTCTTGTCAG CCTTGAGGAA GACAAATGGG CCAAACACAG 
541 AAAAATCATT AGTCCTGCCT TCCATGTCGA GAAGCTAAAG CATATGCTGC CTGCATTTTA 
601 TCTGAGTTGT AGTGAAATGA TAAGCAAATG GGAGGAGGTT GTTCCAAAAG AAACATCATT 

6 61 CGAGCTCGAT GTATGGCCAG ACCTTCAAAT AATGACCAGT GAAGTCATTT CTCGCACTGC 
721 ATTTGGGAGT AGCTATGAAG AAGGAAGAAT AGTATTTGAA CTTCAGAAAG AACAAGCTGA 

7 81 GTATGTAATG GACATAGGAC GTTCAATTTA TATACCAGGA TCA.\GGTTCT TGCCTACTAA 
841 AAGGAACAAA AGAATGCTGG AAATTGAAAA GCAAGTGCAA ACAACAATTA GGCGTATCAT 
901 CGACAAAAGA TTGAAGGCAA TGGAAGAAGG GGAGACTAGT AAAGATGACT TATTAGGCAT 
961 ATTACTTGAA TCCAATTTGA AAGAAATTGA ACTTCATGGA AGAAATGACT TGGGAATAAC 

1021 AACGTCAGAA GTGATTGAAG AGTGCAAGTT ATTCTATTTT GCCGGCCAAG AGACCACTTC 
1081 AGTGTTGCTT GTTTGGACAA TGATTTTGTT GTGCTTACAT CCAGAGTGGC AAGTACGTGC 
1141 CAGAAAGGAA GTGTTGCAGA TCTTTGGAAA TGATAAACCA GATTTGGAAG GACTAAGTCG 
1201 CTTGAAAATT GTAACAATGA TCTTGTACGA GACGTTACGC CTATTCCCCC CATTACCAGC 

12 61 ATTTGGTAGA AGGAACAAAG AAGAAGTCAA ATT AGGGGAG CTACATCTAC CGGCTGGAGT 

13 21 GTTACTCGTT ATACCAGCAA TCTTAGTACA TTATGATAAG GAAATATGGG GTGAAGATGC 
1381 AAAGGAATTC AAACCAGAAA GATTCAGTGA AGGAGTGTCA AAGGCAACAA ATGGACAAGT 
1441 CTCATTTATA CCATTTAGCT GGGGACCTCG TGTTTGCATT GGACAAAACT TCGCAATGAT 
1501 GGAAGCAAAA ATGGCAGTAA CTATGATACT ACAAAAATTC TCCTTTGAAC TATCCCCTTC 
1561 TTATACACAT GCTCCATTTG CAATTGTGAC TATTCATCCC CAGTATGGTG CTCCTCTGCT 
1621 TATGCGCAGA CTTTAAAACA TATGTTGCTG ATATTTAAGA TCAGTGGCGT TTTATT 

SEQ. ID. NO. 278 

1 MEMMYSIIAA ASIAIILVYT WKVUWAWFG PKKMEKCLRQ RGLKGNPTKL LYGDLNELTK 

61 SIIEAKSKPI NFSDDIAQRli IPFFLDAINK NGKNSFWLG PYPIVLITDP EHLKEIFTKN 

121 YVYQKQTHPN PYAKLLAHGL VSLEEDKWAK HRKIISPAFH VEKLKHMLPA FYLSCSEMIS 

181 KWEEWPKET SFELDVWPDL QIMTSEVISR TAFGSSYEEG RIVFELQKEQ AEYVl^IGRS 

241 lYIPGSRFLP TKRNKRMLEI EKQVQTTIRR IIDKRLIC\ME EGETSKDDLL GILLESNLKE 

3 01 lELHGRNPLG ITTSEVIEEC KLFYFAGQET TSVLLVWTMI LLCLHPEWQV RARKEVLQIF 

361 G^TDKPDLEGL SRLKIVTMIL YETLRLFPPL PAFGRRNKEE ^TO^GELMLPA GVLLVIPAIL 

421 VHYDKEIWGE DAKEFKPERF SEGVSKATNG QVSFIPFSVJG PRVCIGQNFA MMEAKMAVTM 

481 ILQKFSFELS PSYTHAPFAI VTIHPQYGAP LLMRRL 
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NAME D101-BA2 

ORGANISM NICOTIANS TABACUM 

SEQ. ID. NO. 279 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 



CTAAATTTCA 
CCCATAGAAG 
AAAAAACAAT 
ATCGGCCATC 
CTCGGTGACT 
TTCTTGGCGG 
TTCGCCGATC 
GCTGTTGCCA 
CTCTCCGTTA 
ATTAAACAAT 
GATAAATTGA 
CATGGAGAAA 
CTCTATGATG 
GCTATGAAAC 
ATTAAGAAGA 
GTGCTTTCCA 
AAAGCAACTG 
TGGGTAATGG 
GACACAATTG 
CTCCAAGCAA 
CACCTATCTG 
CTTACCAATA 
GATCCAGAGA 
TTGATCCCCT 
GAACACCTTT 
CCTTTGGATA 
CTAATTACCC 



TATACCTTTA 
CCATTGTAGG 
CAAAAATCTT 
TCTTTTATTT 
TAGCTGACAA 
TGAGTAGTTA 
GGCCATCTTT 
AATATGGCCC 
GTCGGCTCGA 
TGTATAATTG 
CATTCGACAT 
TACTCAAAGT 
TTTTTCACAT 
AAACTTTCAA 
GAGAAACAAA 
AGATGAGCGA 
TATTCACTTT 
CGTT.AATGAT 
TTGGTAGAGA 
TTGTTAAAGA 
TGGAAGATTG 
TTATGAAACT 
GATTCCTGAC 
TTGGTACGGG 
CAATTGCTCA 
TGAAACAAGG 
CTCGTTT 



GTACTCTTGA 
ATTTGTAACC 
AAACCCACTA 
CAAGAACAAT 
ATATGGTCCC 
TGAAGCTATG 
ACTCTACGGA 
TTACTGGAAA 
AAAATTCAAA 
TGATTCACCA 
CATTTTGAAA 
TGCTTTTCAG 
TCCATTTTTC 
AGACATTGAT 
GGATGTTGGA 
CGAACATCTT 
GGTCTTGGAT 
AAACAATAAG 
TAGATGGGTA 
ACT ATT AC GA 
TGTTGTCAAT 
ACAGCGAGAT 
GACTCATGCT 
GAGACGAGCT 
TATGATCCAA 
TGTGGGTTTA 



AATTTTCAAA 
TTTTCATTTC 
CCTCCAAAAA 
GGCGATGAAG 
GTCTTCACTT 
AAAGAATGCT 
GAATACCTTT 
AAAAATCGAA 
CATGTTAGAT 
ATGGTGAAGA 
ATGGTTGTTG 
AAATTCATGG 
AAGTGGTTGG 
AATATTATCC 
GGTGAAAACG 
GGCGAGGGTT 
GCAACAGACA 
CATGTCATGA 
GAAGAGAGTG 
TTACATCCAC 
GGGTACCATA 
CCTCAAACAT 
ACTATTGACT 
TGTCCCGCGA 
GGTTTCAGTT 
ACTTTACCAA 



TAATGGTTTA 
TATTCTACTT 
TCCCAGGTGG 
ATCGCCATTT 
TCCGGTTAGG 
TCACTACCAA 
GCTATAATAA 
AGTTAGTCAA 
TTTCTATAAT 
TAAACCTTAG 
GG AAGAACTA 
TTCAAGCTAT 
ATCTTACAGG 
AAGGTTGGTT 
AACAAGATTT 
ACTCTCAXaA 
CACTTGCACT 
AGAAAGCACA 
ATATCAAGAA 
CTGCACCTTT 
TTCCTAAGGG 
GGCCAAATCC 
ACCGCGGGCA 
TGAATTATTC 
TTGCAACTAC 
AGAAGACTGA 



TCTTCTTTCT 
TCTATGGACC 
ATGGCCAGTA 
TTCTCAAAAA 
GTTTCGCCGT 
TGATATCCAT 
TGCCATGCTT 
TCAAGAAGTT 
TCAGAAAAAT 
TGATTGGATA 
TAATAATGGA 
GGAGATGGAG 
GAATATTAAG 
AGATGAGCAC 
TATAGATGTG 
CACAACCATC 
TCATATAAAG 
AGAAGAGATG 
TTTGGTGTAT 
GTCAGTGCAA 
GACTGCACTA 
TGATAAATTC 
GCACTATGAG 
ATTGCAAGTG 
GACCAATGAG 
TGTTGAAGTT 



SEQ. ID, NO. 280 

1 MVYLLSPIEA IVGFVTFSFL FYFLWTKKQS KILNPLPPKI PGGWPVIGHL FYFKNNGDED 
61 RHFSQKLGDL ADKYGPVFTF RLGFRRFLAV SSYEAHKECF TTNDIHFADR PSLLYGEYLC 
121 YNNAMIaAVAK YGPYWKKNRK LVNQEVLSVS RLEKFKHVRF SIIQKNIKQL YNCDSPMVKI 
181 MLSDWIDKLT FDIILKMWG KNYNNGHGEI LKVAFQKFMV QAMEMELYDV FHIPFFKWLD 
241 LTGNIKAMKQ TFKDIDNIIQ GWLDEHIKKR ETKDVGGENE QDFIDWliSK MSDEHLGEGY 
301 SHDTTIKATV FTLVLDATDT LALHIKWVMA LMINNKHVMK KAQEEMDTIV GRDRWVEESD 
361 IKNLVYLQAI VKEVLRLHPP APLSVQHLSV EDCWNGYHI PKGTALLTNI MKLQRDPQTW 
421 PNPDKFDPER FLTTHATIDY RGQHYELIPF GTGRRACPAM NYSLQVEHLS lAHMIQGFSF 
481 ATTTNEPLDM KQGVGLTLPK KTDVEVLITP R 
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NAME D130-AA1 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO, 281 

1 CTTTTTCTCC CCAA^AAAGA GCTCATTTCC CTTGTCCCCA A^iAATGGATC TTCTCTTACT 
61 AGAGAAGACC TTAATTGGTC TCTTCTTTGC CATTTTAATC GCTGTA?^TTG TCTCTAGACT 
121 TCGTTCAAAG CGTTTTAAGC TTCCCCCAGG ACCAATCCCA GTACCAGTTT TTGGTAATTG 
181 GCTTCAAGTT GGTGATGATT TAAACCACAG AAATCTTACT GATTTTGCCA AAAAATTTGG 
241 TGATCTTTTC TTGTTAAGAA TGGGCCAGCG TAATTTAGTT GTTGTGTCAT CTCCTGAATT 
301 AGCTAAAGAA GTTTTACACA CACAAGGTGT TGAATTTGGT TCAAGAACAA GAAATGTTGT 
361 ATTTGATATT TTTACTGGAA AAGGTCAAGA TATGGTTTTT ACTGTATATG GTGAACACTG 
421 GAGAAAAATG AGGAGAATTA TGACTGTACC ATTTTTTACT AATAAAGTTG TGCAGCAATA 
481 TAGAGGGGGG TGGGAGTTTG AAGTGGCAAG TGTAATTGAG GATGTGAAGA AAAATCCTGA 
541 ATCTGCTACT AATGGGATTG TATNAAGGAG GAGATTACAA TTGATGATGT ATAATAATAT 
601 GTTTAGGATT ATGTTTGATA GGAGATTTGA GAGTGAAGAT GATCCTTTGT TTGTTAAGCT 
6 61 TAAGGCTTTG AATGGTGAAA GGAGTAGATT GGCTCAGAGT TTTGAGTATA ATTATGGTGA 
721 TTTTATTCCC ATTTTGAGGC CTTTTTTGAG AGGTTATTTG AAGATCTGTA AAGA^^GTTAA 
781 GGAGAAGAGG CTGCAGCTTT TCAAAGATTA CTTTGTTGAT GAAAGAAAGA AGCTTTCAAA 
841 TACCAAGAGC TTGGACAGCA ATGCTCTGAA ATGTGCGATT GATCACATTC TTGAGGCTCA 
901 ACAGAAGGGG GAGATCAATG AGGACAACGT TCTTTACATT GTTGAAAACA TCAATGTTGC 
961 TGCTATAGAA ACCACATTAT GGTCAATTGA GTGGGGTATC GCCGAGTTAG TCAACCACCC 
1021 TCACATCCAA AAGAAACTCC GCGACGAGAT TGACACAGTT CTTGGCCCAG GAGTGCAAGT 
1081 GACTGAACCA GACACCCACA AGCTTCCATA CCTTCAGGCT GTGATCAAGG AGACGCTTCG 
1141 TCTCCGTATG GCAATTCCTC TATTAGTCCC ACACATGAAC CTTCACGATG CAAAGCTTGG 
12 01 CGGGTTTGAT ATTCCAGCAG AGAGCAAAAT CTTGGTTAAC GCTTGGTGGC TAGCTAACAA 
1261 CCCGGCTCAT TGGAAGAAAC CCGAAGAGTT CAGACCCGAG AGGTTCTTCG AAGAGGAGAA 
1321 GCACGTTGAG GCCAATGGCA ATGACTTCAG ATATCTTCCG TTTGGCGTTG GTAGGAGGAG 
1381 TTGCCCTGGA ACTATACTTG CATTGCCAAT TCTTGGCATT ACTTTGGGAC GTTT 

SEQ, ID, NO. 282 

1 MDLLLLEKTIi IGLFFAILIA VIVSRLRSKR FKLPPGPIPV PVFGtWLQVG DDLNHRNLTD 
61 FAKKFGDLFL LRWGQRNLW VSSPELAKEV LHTQGVEFGS RTRNWFDIF TGKGQPMVFT 
121 VYGEHWRKMR RIMTVPFFTN KWQQYRGGW EFEVASVIED VKKNPESATN GIVLRRRLQL 
181 MMYX^NMFRIM FDRRFESEDD PLFVKLKALN GERSRLAQSF EYITYGDFIPI LRPFLRGYI^K 

2 41 ICKEVKEKRL QLFKDYFVDE RKKLSNTKSL DSNALKCAID HILEAQQKGE INEDNVLYIV 
301 ENINVAAIET TLWSIEWGIA ELVNHPHIQK KLRDEIDTVD GPGVQVTEPD THKLPYLQAV 

3 61 IKETLRLRMA IPLLVPHMNL HDAKLGGFDI PAESKILVNA WWLANNPAHW KKPEEFRPER 
421 FFEEEKHVEA NGNDFRYLPF GVGRRSCPGT ILALPILGIT LGR 
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NAl^E D13 6-AD5 

ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 283 

1 CCAAATTAGA GCAAGAAATT AACAAGTCTA GTTACCTTCT CCCTTTTTAA GAGTATTTAA 
61 GATTTAAGAT TTAAGATGAA GCAACTGAGG TAAGTCCTTT CAAGGAGTAG TTGTCACTTC 
121 TGAGAATGGA GATGATGTAC AGCATAATAG CAGCAGCCAG TATTGCAATT ATCTTGGTAT 
181 ATACATGGAA AGTGTTGAAT TGGGCTTGGT TTGGGCCAAA GAAAATGGAG AAATGCTTAA 
241 GACAGAGGGG TCTCAAGGGA AATCCTTATA AGCTACTCTA TGGAGATCTA AACGAACTGA 
301 CAAAAAGCAT AATAGAAGCC AAGTCTAAGC CCATCAATTT CTCTGATGAT ATTGCTCAAA 
3.61 GGCTCATCCC TTTTTTTCTT GACGCCATCA ACAAAAATGG TAAAAACTCC TTCGTCTGGC 
421 TTGGACCGTA TCCAATAGTG TTaATCACGG ATCCTGAGCA TTTAAAGGAG ATTTTCACAA 
4 81 AGAATTATGT GTATCAAAAG CAAACTCATC CCAATCCATA CGCCAAGCTA TTAGCTCACG 
541 GTCTTGTCAG CCTTGAGGAA GACAAATGGG CCAAACACAG AAAAATCATT AGTCCTGCCT 
601 TCCATGTCGA GAAGCTAAAG CATATGCTGC CTGCATTTTA TCTGAGTTGT AGTGAAATGA 
661 TAAGCAAATG GGAGGAGGTT GTTCCAAAAG AAACATCATT CGAGCTCGAT GTATGGCCAG 
721 ACCTTCAAAT AATGACCAGT GAAGTCATTT CTCGCACTGC ATTTGGGAGT AGCTATGAAG 
781 AAGGAAGAAT AGTATTTGAA CTTCAG-ZIAAG AACAAGCTGA GTATGTAATG GACATAGGAC 
841 GTTCAATTTA TATACCAGGA TCAAGGTTCT TGCCTACTAA AAGGAACAAA AGAATGCTGG 
901 AAATTGAAAA GC AAGTGCAA ACAACAATTA GGCGTATCAT CG AC AAAAGA TTGAAGGCAA 
961 TGGAAGAAGG GGAGACTAGT AAAGATGACT TATTAGGCAT ATTACTTGAA TCCAATTTGA 
1021 AAGAAATTGA ACTTCATGGA AGAAATGACT TGGGAATAAC AACATCAGAA GTGATTGAAG 
1081 AGTGCAAGTT AATCTATTTT GCCGGCCAAG AGACCACTTC AGTGTTGCTT GTTTGGACAA 
1141 TGATTTTGTT GTGCTTACAT CCAGAGTGGC AAGTACGTGC CAGAAAGGAA GTGTTGCAGA 
1201 CCTTTGGAAA TGATAAACCA GATTTGGAAG GACTAAGTCG CTTGAAAATT GTAACAATGA 
1261 TCTTGTACGA GACGTTACGC CTATTCCCCC CATTACCAGC ATTTGGTAGA AGGAACAAAG 
1321 AAGAAGTCAA ATTAGGGGAG CTACATCTAC CGGCTGGAGT GTTACTCGTT ATACCAGCAA 
1381 TCTTAGTACA TTATGATAAG GAAATATGGG GTGAAGATGC AAAGGAATTC AAACCAGAAA 
1441 GATTCAGTGA AGGAGTGTCA AAGGCAACAA ATGGACAAGT CTCATTTATA CCATTTAGCT 
1501 AGGGACCTCG TGTTTGCATT GGACAAAACT TCGCAATGAT GCAAGCAAAA ATGGCAGTAA 
1561 CTATGATACT ACAAAAATTC TCCTTTGAAC TATCCCCTTC TTATACACAT GCTCCATTTG 
1621 CAATTGTGAC TATTCATCCC CAGTATGGTG CTCCTCTGCT TATGCGCAGA CTTTAAAACA 
1681 TATGTTGCTG ATATTTAAGA TCAGTGGCGT TTTATTCTCC ATG 



SEQ. ID. NO, 284 

1 MEI'IMYSIIAA ASIAIILVYT WKVLNWAWFG 

61 SIIEAKSKPI NFSDDIAQRL IPFFLDAINK 

121 YAA'QKQTHPN PYAKLLAHGL VSLEEDKWAK 

181 KWEEWPKET SFELDVWPDL QIMTSEVISR 

241 lYIPGSRFLP TKRNKRMLEI EKQVQTTIRR 

301 lELHGRNDLG ITTSEVIEEC KLIYFAGQET 

3 61 GNDKPDLEGL SRLKIVTMIL YETLRLFPPL 

421 VHYmEIWGE DAKEFKPERF SEGVSKATNG 



PKKMEKCLRQ RGLKGNPYKL LYGDLNELTK 
NGKNSFVWLG PYPIVLITDP EHLKEIFTKN 
HRKIISPAFH VEKLKHMLPA FYLSCSEMIS 
TAFGSSYEEG RIVFELQKEQ AEYVMDIGRS 
IIDKRLKAME EGETSKDDLL GILLESNLKE 
TSVLLVWTMI LLCLHPEWQV RARKEVLQTF 
PAFGRRNKEE VKLGELHLPA GVLLVIPAIL 
QVSFIPFS 
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NAME D138-AD12 

ORGANISM NICOTIANA TABACUM 
SEQ. ID* NO. 285 

1 TTTGCCTTTG CTCGTCATTG ATGACGACTT CATTTTGTTT TCTTCCCCAC GAAAATGGTA 

61 GATATGATAT GGAGGGACGT AGGGAAGAAT TACTGGGACA AACCTAGTGA GTGAAAATGG 

121 AAACAGTTGA AATGATAGTA AAAGTATCTT GTGCTGCCAT AGTAATTACT CTGTTGGTGT 

181 GTCTATGGAG AGTGCTGAAT TGGGT'rTGGT TCAGACCAAA GAAATTAGAG AAGTTGTTGA 

241 GAAAACAGGT TTTGTATGGG GACATGAAAG AGTTTTCTGG GATGATTAAG GAAGCATACT 

301 CAAAGCCTAT GAGTCTGTCT GATGATGTAG CACCACGAAT GATGCCTTTC TTTCTTGAAA 

361 CCATCAAGAA ATATGGAAAA AGATCCTTTA TATGGTTCGG TCCAAGACCA CTAGTATTGA 

421 TCATGGATCC TGAGCTTATA AAGGAAGTAC TCTCCAAAAT CTATCTTTAT CAAAAGCCCG 

481 GTGGAAATCC ATTAGCAACA CTATTGGTAC AAGGATTAGC AACCTATGAG GAAGACAAAT 

541 GGGCC AAACA TAGAAAAATC ATCAATCCCG CTTTCCATCT AGAGAAGCTA AAGCATATGC 

601 TTCCAGCTTT TCGCTTGAGC TGTAGTGAGA TGCTGAGCAA ATGGGAAGAC ATTGTTTCAG 

661 CTGAAGGCTC ACATGAGATA GATGTATGGC CTAACCTTGA GCAATTGAGT TGCGATGTGA 

721 TCTCTCGGAC AGCTTrTGGC AATAGTTATG AAGAAGGTAG AAAGATTTTT GAACTTCAAA 

781 AGGAACAAAC TCAGCATCTT GTGGAAGCTT TCCGCTCTGT TTATATCCCA GGAAGGAGAT 

841 TTTTGCCAAC AAAGAGGAAT AGAAGAATGA AGGAAATAAA AAAGGAGGTT CGAGCGTCAA 

901 TTAAAGGTAT TATTGATAAA AGATTGAAGG CAATGAAAGC AGGGGACACC AATAATGAGG 

961 ATCTATTGGG ATATTGCTGG AATCAAATTT TAAAGAAATT GAACAGCGCG GAAACAAGGA 

1021 TTTTGGAATG AGCATTGAAG ATGTCATTGA AGAATGCAAG TTATTCTATT TTGCTGGCCA 

1081 AGAAACTACA TCAGTGTTGC TCCTATGGTC TCTAGTGTCG TTGAGCAGGT ATCAAGATTG 

1141 GCAGACACGG GCCAGAGAAG AAGTCTTGCA TGTCTTTGGG AGTCGGAAAC CAGATTTTGA 

1201 TGAATTAAAT CATCTAAAAG TTGTGACAAT GATCATGTAC GAGTCTTTAA GGCTATATCC 

12 61 CTCACTAATA ACACTTACCC GCCGGTGTAA TGAAGACATT GTATTAGGAG AACTATCTCT 
1321 ACCAGCTGGT GTCCTAGTCT CTTTGCCAAT GATTTTGTTG CATCATGATG AAGAGATATG 

13 81 GGGTGAAGAT GCAAAGGAGT TCAAACCAGA GAGATTTAGA GAAGGATTGT CAAGTGCAAC 
1441 AAAGGGTCAA CTTACATATT TTCCATTTGG CTGGGGTCCT AGAATATGTA TTGGACAAAA 
1501 TTTTGCCATG TTAGAAGCAA AGATGGCTCT GTCTATGATC CTGCAACGCT TCTCTTTTGA 
1561 ACTGTCTCCG TCTTATGCAC ATGCCCCTCA GTCCATATTA ACCGTTCAGC CTCAATATGG 
1621 TGCTCCACTT ATTTTCCACA AGCTATAATT TGGTACTTGT GAAAGGTGTC TTGTACAATA 
1681 TGTTAGTAGA GTTTATTCAG ACTTAGATAC ATGCTTC 

SEQ. ID. NO. 286 

1 METVEMIVKV SCAAIVITLL VCLWRVLNWV WFRPKKLEKL LRKQVLYGDM KEFSGMIKEA 

61 YSKPMSLSDD VAPRMMPFFL ETIKKYGKRS FIVJFGPRPLV LIMDPELIKE VLSKIYLYQK 

121 PGGNPLATLL VQGLATYEED KWAKHRKIIN PAFHLEKLKH I'lLPAFRLSCS EMLSKWEDIV 

181 SAEGSHEIDV WPNLEQLSCD VISRTAFGNS YEEGRKIFEL QKEQTQHLVE AFRSVYIPGR 

241 RFLPTKRNRR MKEIKKBVRA SIKGIIDKRL KAMKAGDTNN EDLLGYCWNQ ILKKLNSAET 

301 RILE 
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NAME D216-AG8 

ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 287 

1 CCAAAATGCA GTTCTTCAAC TTCATTTCCT TTGTCTTTTT TGTGTCTTTC CTCTTTTTAT 

61 TAAGGAAATG GAAGAACTCC AATAGCCAAA CCAAAAGATT GCCTCCAGGT CCATGGAAAT 

121 TACCTGTACT TGGAAGCATG TTTCATTTGC TAGGTGGACC TCCACATCAT GTCCTTGGAG 

181 ATTTAGCCAA AAAATATGGT CCACTTATGC ACCTTCAACT AGGTGAAGTT TCTGTAGTTT 

241 CTGTTACTTC TCCTa^GATG GCPJiAAGAAG TACTAAAAAC TCATGACCTC GCTTTTGCAT 

3 01 CTAGGCCGTT ACTTTTGGCA GCCA-AAATTG TCTGCTATAA TGGGACAGAC ATTGTCTTTT 

361 CCCCCTATGG CGATTATTGG AGACAAACGC GTAAAATTTG TCTCTTGGAA TTGCTCAGTG 

421 CCAAAAATGT TAGGTCATTC AGCTCAGTCA GACGAGATGA AGTTTTCCAT ATGATTGAAT 

481 TTTTTTCGAT CATCTTCTGG TAAGCCAGTT AATGTATCAA AAAGGATTTC TCTATTCACA 

541 ACCTCTATGA CATGTAGATC AGCCTTTGGA CAAGAATACA AGGAGCAAGA CGAATTCGCA 

601 CAACTAGTAA AAAAAGTGTC AAGCTTAATG GAAGGGTTTG ATGTTGCTGA TATATTCCCT 

661 TCATTGAAGT TTCTTCATGT GCTCAGTGGA ATGAAGGCTA AAGTTATGGA TGCACACCAT 

721 GAGTTAGATG CCATTCTTGA AAAAATTATC AATGAGCACA AGAAAATTGC AACTGGAAAG 

781 AATAATAATG AATTAGGAGG TGA.^GGATTA ATTGACGTAC TGCTAAGACT TATGAAAGAG 

841 GGAGGCCTTC AATTCCCGAT CACCAACGAC AACATCAAAG CTATTATTTT TGACATGTTT 

901 GGTGCGGGAA CGGAAACTTC ATCAACCACA ATTGACTGGG CCATGGTCGA AATGATAAAG 

961 AATCCAAGTG TATTCGCTAA AGCTCAAGCA GAGGTAAGAG AAGCCTTCAG AGAGAAAGAA 

1021 ACTTTTGATG AAAATGATGT CGAGGAGTTG AAATACTTAA AATTGGTTAT CAAAGAAACT 

1081 TTCAGACTCC ATCCTCCATT TCCCCTTTTG CTCCCAAGAG AATCTAGAGA AGAAACAGAT 

1141 ATAAACGGCT ACACTATTCC TTTTAAAACA AAACTTATGG TTAACGTTCG GGCTATTGGA 

1201 AGAGATCCAA AATATTGGGA TGACGTGGAA AGTTTTAAGC CAGAGAGATT TGAGCACAAC 

12 61 TCTATGGATT TTATTGGTAA TAATTTTGAA TATCTTCCCT TTGGTAGTGG AAGGAGAATG 

1321 TGCCCTGGGA TATCATTTGG TTTGGCTAAT GTTTATTTGC CACTAGCTCA ATTGTTATAT 

1381 CATTTTGATT GGAAACTCCC TACTGGAATC AATTCAAGTG ACTTGGACAT GACTGAGTCG 

1441 TCAGGAGTAA CTTGTGCTAG AAAGAGTGAT TTATACTTGA CTGCTACTCC ATATCAACTT 

1501 TCTCAAGAGT GATGCAATGA TATCAACCTT TTGAATTTCG GTCAACCCCA CCAATAGTG 

SEQ. ID. NO. 2 88 

1 MQFFNFISFV FFVSFLFLLR KWKNSNSQTK RLPPGPWKLP VLGSMFHLLG GPPHHVX>GDL 

61 AKKYGPLMHli QLGEVSWSV TSPEMAKEVL KTHDLAFASR PLLUAAKIVC YNGTDIVFSP 

121 YGDYWRQTRK ICLLELLSAK NVRSFSSVRR DEVFHMIEFF SIIFW 
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NAME D243-AB3 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 2 89 

1 CCCCACCAAA AAATCATTTC TCTCGTCTAA AATGGATCTT CTCTTACTAG AGAAGACCTT 
61 AATTGGTCTT TTCTTTGCCA TTTTAATCGC TTTAATTGTC TCTAAACTTC GTTCAAAGCG 
121 TTTTAAGCTT CCTCCAGGAC CAATTCCAGT ACCAGTTTTT GGTAATTGGC TTCAAGTTGG 
181 TGATGATTTA AACCACAGAA ATCTTACTGA TTATGCCAAG AAATTTGGAG ATCTTTTCTT 
241 GTTAAGAATG GGTCAACGTA ACTTAGTTGT TGTGTCATCT CCTGAATTAG CTAAAGAAGT 
3 01 TTTACACACA CAAGGTGTTG AATTTGGTTC AAGAACAAGA AATGTTGTGT TTGATATTTT 

3 61 TACTGGAAAA GGTCAAGATA TGGTTTTTAC TGTATATGGT GAACATTGGA GAAAAATGAG 
421 GAGAATTATG ACTGTACCAT TTTTTACTAA TAAAGTTGTG CAACAGTATA GAGGGGGGTG 

4 81 GGAGTTTGAG GTGGCAAGTG TAATTGAGGA TGTGAAAAAA AATCCTGAAT CTGCTACTAA 
541 TGGGATCGTA TTAAGGAGGA GATTACAATT AATGATGTAT AATAATATGT TTAGGATTAT 
601 GTTTGATAGG AGATTTGAGA GTGAAGATGA TCCTTTGTTT GTTAAGCTTA AGGCTTTaAA 
661 TGGTGAAAGG AGTAGATTGG CTCAAAGTTT TGAGTATAAT TATGGTGATT TTATTCCAAT 
721 TTTGAGGCCT TTTTTTGAGA GGTTATTTGA AGATCTGTAA AGAAGTTAAG GAGAAGAGGC 
781 TGCAGCTTTT CAAAGATTAC TTTGTTGATG AAAGAAAGAA GCTTTCGAAT ACCAAGAGCT 
841 CGGACAGCAA TGCCCTAAAA TGTGCGATTG ATCACATTCT TGAGGCTCAA CAGAAGGGAG 
901 AGATCAATGA GG AC AACGTT CTTTACATTG TTGAAAACAT CAATGTTGCT GCAATTGAAA 
961 CAACATTATG GTCAATTGAG TGGGGTATCG CCGAGCTAGT CAACCACCCT CACATCCAAA 

1021 AGAAACTGCG CGACGAGATT GACACAGTTC TTGGACCAGG AGTGCAAGTG ACTGAACCAG 
1081 ACACCCACAA GCTTCCATAC CTTCAGGCTG TGATCAAGGA GGCACTTCGT CTCCGTATGG 
1141 CAATTCCTCT ATTAGTCCCA CACATGAACC TTCACGACGC AAAGCTTGGC GGGTTTGATA 
1201 TTCCAGCAGA GAGCAAAATC TTGGTTAACG CTTGGTGGTT AGCTAACAAC CCGGCTCATT 
1261 GGAAGAAACC CGAAGAGTTC AGACCCGAGA GGTTCTTTGA AGAGGAGAAG CATGTTGAGG 
1321 CCAATGGCAA TGACTTCAGA TATCTTCCGT TTGGCGTTGG TAGGAGGAGC TGCCCTGGAA 
1381 TTATACTTGC ATTGCCAACT CTTGGCATCA CTTTGGGACG TTTGGTTCAG AACTTTGAGC 
1441 TGTTGCCTCC TCCAGGCCAG TCGAAGCTCG ACACCACAGA GAAAGGTGGA CAGTTCAGTC 
1501 TCCACATTTT GAAGCATTCC ACCATTGTGT TGAAACCAAG GTCTTTCTGA ACTTTGTGAT 
1561 CTTATTAATT AAGGGGTTCT GAAGAAATTT GATAGTGTTG G 

SEQ. ID. NO. 2 90 

1 MDLLLLEKTL IGLFFAILIA LIVSKLRSKR FKLPPGPIPV PVFGNWLQVG DDL^JHRNLTD 
61 YAKKFGDLFL LRMGQR^JLW VSS PELAKEV LHTQGVEFGS RTR^JVVFDIF TGKGQDMVFT 
121 VYGEHWRKMR RIMTVPFFTN KWQQYRGGW EFEVASVIED VKKNPESATN GIVLRRRLQL 
181 M14YNNMFRIM FDRRFESEDD PLFVKLKALN GERSRLAQSF EYNYGDFIPI LRPFFERLFE 
241 DL 
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NAME D205-AH4 
ORGANISM NICOTIANA 
SEQ. ID. NO. 291 

1 GTGAGGTTTG 
61 AAAATATTGT 
121 TTCATTTCAT 
181 TCATTCTCTT 
241 TATTTCTTCC 
3 01 CACCTTTACC 
3 61 GGTCCTGTTT 
421 GCTGTTGAAG 
481 GCTGGTGACA 
541 TGGAGAATTC 
601 ACTTCTATCC 
661 AAGGATAGTA 
721 ATGACCAAAA 
781 GGCATTGAAA 
841 ATGTGTGATT 
901 GCCTCAATTC 
961 AAGAAAAGTA 
1021 CTGATTGAAA 
10 81 ATCAAAAGTA 
1141 CAATGGGTAA 
12 01 ATTGACAGTA 

12 61 TATTTGCATT 

13 21 CCTCATTATT 

13 81 TTGTTTGTCA 

14 41 TTCAAGCCAG 

15 01 GTACCATTTG 
1561 TCTTTGGCAT 
1621 TTGGAGGAAA 

16 81 ACTCCACGCG 
1741 AACGTAATCT 

SEQ. ID. NO. 2 92 

1 MENQYSYSFS 
61 TSLSAJGl'GPV 
121 WAPYGQLWRI 
181 FTLVFNLMTK 
241 KGLEKKMASI 
3 01 EFYTDDIIKS 
3 61 SDLNKLPYLH 
421 VWEEPDKFKP 
481 QIEEAESLEE 



TABACUM 

AATCCTCTGC 
CCAAAGCTAA 
AGCTCAATCT 
CCTACTTCTA 
ATCGGAGAAG 
TTCTCAAGAA 
TATACCTCAA 
AATGTTTAAC 
AGTTTACCTA 
TTCGCCGATT 
TTAGAGATCA 
GCAAAAAAGT 
TTATTGCTGG 
TTATTGAAAA 
TCTTGCCAGT 
ACAATAGAAG 
GTGCTTCACA 
AGCTCTTGTC 
TTATGCTGGT 
TGAGGCTTCT 
AAGTTGGGAA 
GTGTTGTTAA 
CAACTAAAGA 
ACGCTTGGGC 
AGAGATTTGA 
GAATGGGGAG 
TAGGTGCACT 
GCTATAATTC 
AAGATCTTGG 
TCATCTACCA 



CTCAATGAAA 
AGATTCTTTC 
TTCTTATTTT 
CTTAGCTATA 
AAATTTACCT 
AACTCTCCAT 
ATTGGGCTCT 
CAAGAATGAT 
CAATTATACT 
AACTGTCGTT 
AGAAGTTGCA 
CGATTTGACC 
GAGACATATT 
ACTTAGAGGG 
ATTCAGGTGG 
AAATGAATTC 
ATCTAACACA 
TCTTCAAGAA 
AGTTTTTGTT 
TGTAGCTCAC 
TAAGCGCTTG 
TGAGACAATG 
TTGTATTGTG 
CATTCACAGG 
GGCAACAGAA 
AAGAGCGTGC 
TATTCAATGC 
TAGAATGACT 
CCAGCTTCTA 
CTATG 



CTCACCAAAT 
TCCTTAAATA 
GATTCAACCA 
GTACTGTTTC 
CCAAGTCCAT 
CTCACTCTAA 
ATGCCTGTGA 
ATCATATTCG 
GTTTATGTTT 
GAACTCTTCT 
ATATTTATCC 
AACTGGTCTT 
GTGAAGGAGG 
ACTTTCTTAG 
GTTGGTTACA 
TTGAACAGCT 
ACTGTTGGAA 
TCAGAGCCTG 
GCAGGAACAG 
CCTGAGGCAT 
CTGAATGAAT 
AGATTATACA 
GAAGGATATG 
GATCCCAAGG 
GGGGAAACAG 
CCTGGAGCTG 
TTTGACTGGC 
ATGCAGAACA 
TCCCAACTCT 



TGGTTTTCTA 
GTCAACTTTA 
TGGAGAACCA 
TTCTTCCAAT 
TTTCTCTTCC 
CATCCTTATC 
TTGTTGTGTC 
CAAATAGGCC 
GGGCACCCTA 
CTTCACATAC 
GTTCGTTATA 
TTACTTTGGT 
AAGATGCTGG 
TAACTACATC 
AAGGGCTGGA 
TGCTTGATGA 
ACATGGAGAA 
AATTCTACAC 
AGACCTCATC 
TGTATAAGCT 
CAGACCTCAA 
CTCCGATACC 
ATGTACCAAA 
TATGGGAGGA 
AAAGGTTCAA 
ATATGGGGTT 
AAATTGAGGA 
AGCCTTTGAA 
AAGGCAATTT 



ATTTCCATCT 
GTGGTTCCTC 
ATACTCCTAC 
TTTGGTCAAA 
AATAATTGGT 
AGCTAAATAT 
CTCACCATCT 
CAAGACCGTG, 
TGGCCAACTT 
CCTACAGAAA 
CAAATTCTCA 
TTTCAATCTT 
CAAGGAAAAG 
ATTCTTGAAT 
GAAGAAGATG 
ATTTCGACAC 
GAAAACCACA 
TGATGATATC 
AACAACCATC 
ACGAGCTGAC 
CAAGCTTCCG 
ACTTTTATTG 
ACATACAATG 
GCCTGACAAG 
TTACAAGCTT 
GCGAGCAGTT 
AGCGGAAAGC 
GGTTGTCTGC 
ATCAATGCCA 



SYFYLAIVLF 
LYLKLGSMPV 
LRRLTWELF 
IIAGRHIVKE 
HNRRNEFLNS 
IMLWFVAGT 
CWNETMRLY 
ERFEATEGET 
SYNSRMTMQN 



LLPILVKYFF 
IWSSPSA\^ 
SSHSLQKTSI 
EDAGKEKGIE 
LLDEFRHKKS 
ETSSTTIQWV 
TPIPLLLPHY 
ERFNYKLVPF 
KPLKWCTPR 



HRRRNLPPSP 
ECLTKNDIIF 
LRDQEVAIFI 
IIEKLRGTFL 
SASQSNTTVG 
MRLLVAHPEA 
STKDCIVEGY 
GMGRRACPGA 
EDLGQLLSQL 



FSLPIIGHLY 
ANRPKTVAGD 
RSLYKFSKDS 
VTTSFLNMCD 
NMEKKTTLIE 
LYKLRADIDS 
DVPKHTMLFV 
DMGLRAVSLA 



LLKKTLHLTL 
KFTYNY'TVYV 
SKKVDLTNWS 
FLPVFRWVGY 
KLLSLQESEP 
KVGNKRLLNE 
NAWAIHRDPK 
LGALIQCFDW 
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NAME D267-AF10 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 2 93 

1 AACATCCTTT CCTTCTTCCA AAAATGGAGC TTCAATCTTC TCCTTTCAAT TTAATTTCTT 
61 TGTTCCTCTT CTTTTCTTTT CTTTTTATTC TAGTGAAGAA ATGGAATGCC AAAATCCCAA 
121 AGTTACCTCC AGGTCCGTGG AGGCTTCCCT TTATTGGAAG CCTCCATCAC TTGAAGGGAA 
181 AACTTCCACA CCATAATCTT AGAGATCTAG CGCGAAAATA TGGACCTCTC ATGTACTTAC 
241 AACTCGGAGA AATTCCTGTA GTTGTAATAT CTTCGCCACG TGTAGCAAAA GCTGTACTAA 
3 01 AAACTCATGA TCTCGCTTTT GCAACTAGAC CACGATTCAT GTCCTCAGAC ATTGTGTTTT 

3 61 ACAAAAGCAG GGACATCTCT TTTGCCCCAT TTGGTGATTA CTGGAGACAG ATGCGTAAAA 
421 TATTGACTCA GGAACTCCTG AGCAACAAGA TGCTCAAGTC ATATAGCTTA ATCCGAAAGG 

4 81 ATGAGCTCTC GAAGCTCCTC TCATCGATTC GTTTGGAAAC AGGTTCTGCA GTGAACATAA 
541 ATGAAAAGCT TCTCTGGTTT ACGAGCTGCA TGACCTGTAG ATTAGCCTTT GG AAAAATAT 
601 GCAATGATCG GGATGAGTTG ATCATGCTAA TTAGGGAGAT ATTAACATTA TCAGGAGGAT 
661 TTGATGTGGG TGATTTGTTC CCTTCCTGGA AATTACTTCA TAATATGAGC AACATGAAAG 
721 CTAGGTTGAC GAATGTACAC CACAAGTATG ATTTAGTTAT GGAGAACATC ATCAATGAGC 
7 81 ACCAAGAGAA TCATGCAGCA GGGATAAAGG GTAACAACGA GTTTGGTGGC GAAGATATGA 
841 TCGATGCTCT ACTGAGGGCT AAGGAG.Su^TA ATGAGCTTCA ATTTCCTATC GAAAATGACA 
901 ACATGAAAGC AGTAATTCTG GACTTGTTTA TTGCTGGAAC TGAAACTTCA TATACTGCAA 
961 TTATATGGGC ACTATCAGAA TTGATGAAGC ACCCAAGTGT GATGGCCAAG GCACAAGCTG 

1021 AAGTGAGAAA AGTCTTCAAA GAAAATGAAA ATTTCGACGA AAATGATCTT GACAAGTTGC 
1081 CATACCTAAA ATCAGTGATT AAAGAAACAC TAAGGATGCA CCCTCCAGTT CCTTTGTTAG 
1141 GGCCTAGAGA ATGCAGGGAC CAAACAGAGA TCGATGGCTA CACTGTACCT ATTAAAGCTA 
1201 GAGTTATGGT TAATGCTTGG GCGATAGGAA GAGATCCTGA AAGTTGGGAA GATCCTGAAA 

12 61 GTTTCAAACC GGAGCG ATTT GAAAATACTT CTGTTGATCT TACAGGAAAT CACTATCAGT 

13 21 TCATTCCTTT CGGTTCAGGA AGAAGAATGT GTCCAGG AAT GTCGTTTGGT TTAGTTAACA 

13 81 CAGGGCATCC TTTAGCCCAG TTGCTCTATT GCTTTGACTG GAAACTCCCT GACAAGGTTA 

14 41 ATGCAAATGA TTTTCGCACT ACTGAAACAA GTAGAGTTTT TGCAGCAAGC AAAGATGACC 

15 01 TCTACTTGAT TCCCACAAAT CACAGGGAGC AAGAATAGCT TAATTTAATG GAGTTCTTGG 
1561 AAGAATTAAA GAAGAAGGGC TATATAGGTG AGATTTTTTG TATGGTTGCA AGGTTTTTAG 
1621 TTCATACAAT AAGACAATAC ATTATATTCC AGTATTGTGT ATCATGTATA ATAAGGTTCC 
1681 TTTTGTTTAA AAAA 



SEQ. ID. NO. 294 

1 MELQSSPFNL ISLFLFFSFL FILVKKWNAK 

61 DLARKYGPLM YLQLGEIPW VISSPRVAKA 

121 APFGDYWRQM RKILTQELLS NKMLKSYSLI 

181 SCMTCRLAFG KICl^RDELI MLIREILTLS 

241 KYDLVMENII NEHQENHAAG IKGNNEFGGE 

3 01 LFIAGTETSY TAIIWALSEL MKHPSVMAKA 

3 61 ETLRMHPPVP LLGPRECRDQ TEIDGYTVPI 

421 NTSVDLTGNH YQFIPFGSGR RMCPGMSFGL 

481 ETSRVFAASK DDLYLIPTNH REQE 



IPKLPPGPWR LPFIGSLHHL KGKLPHHNLR 
VLKTHDLAFA TRPRFMSSDI VFYKSRDISF 
RKDELSKLLS SIRLETGSAV NINEKLLWFT 
GGFDVGDLFP SWKLLHNMSN MKARLTNVHH 
DMIDALLRAK ENNELQFPIE NDNMKAVILD 
QAEVRKVFKE NENFDENDLD KLPYLKSVIK 
KARVMVNAWA IGRDPESWED PESFKPERFE 
VNTGHPLAQL LYCFDWKLPD KVNANDFRTT 
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NAME D284-AH5 
ORGANISM NICOTIANA TABACUM 
SEQ. ID. NO. 295 

1 CAATCAGTGG ATGCGGGAGT AATATATAAT ATGCAAGTTG TAGAAAGAGA AAAAAAAAAT 
61 CAAGTAGCTA TTCTATACTG GGGCACAAAT AGTGAGTGAA AATGGAGACT GTTCAAATCA 
121 TAATAACAGC ATCTTGTGCT GCCATAATAA TTACTCTAGT GGTGTGTATT TGGAGAGTAC 
181 TGAATTGGGT TTGGTTCAGA CCAAAGAAGC TGGAAAAACT ATTGAGGAAA CAAGGTCTCA 
241 AAGGCAACTC CTACAAGATT TTGTATGGGG ATATGAAGGA GCTTTCTGGT ATGATTAAGG 
3 01 AAGCTAATTC CAAACCCATG AATCTTTCTG ATGATATTGC ACCAAGATTG GTGCCTTTCT 

3 61 TTCTTGACAC CATCAAGAAA TATGGTAAAA AATCCTTTGT ATGGTTAGGT CCGAAACCAC 
421 TGGTTCTTAT CATGGACCCT GAGCTTATAA AGGAAATATT TTCCAAATAC TATCTGTATC 

4 81 AAAAGCCTCA TGGAAATCCA GTTACCAAGC TATTAGTACA AGGACTAGTA AGCCTAGAGG 
541 AAGACAAATG GGCCAAACAT AGAAAAATCA TCAATCCAGC TTTCCATCTA GAGAAGCTAA 
601 AGCATATGCT TCCAGCTTTT TGCTTGAGCT GCACTGAGAT GCTGTGCAAA TGGGAAGATA 

6 61 TTGTTTCAAT TAAGGGCTCA CATGAGATAG ATGTATGGCC TCACCTTGAA CAATTAAGTA 
721 GCGATGTGAT CTCTCGGACA GCTTTTGGCA GTAACTTTGA AGAAGGTAAA AGGATATTTG 

7 81 AACTTCAGAA GGAACAAGCT CAGTATTTTG TAGAAGCTAT ACGCTCGGTT TATATACCAG 
841 GCTGGAGGTT TTTGCCAACA AAGAGGAACA GAAGAATGAA GGAAGTTGAA AAGGATGTTC 
901 GGGCCTCGAT AAGAGGCATT ATTGATAAAA GAGTGAAGGC AATGAAAGCA GGAGAGGCGA 
961 GTAATGAGGA TCTACTTGGT ATATTGTTGG AATCTAATTT TACAGAAGCT GAACAGCATA 

1021 GACACAAGGA TTCTGCGATG AGCATTGAAG AAGTCATTCA AGAATGCAAG TTATTCTATG 
10 81 TTGCTGGCCA AGAAACTACA TCAGTGTTGC TTGTGTGGAC TCTAATATTG TTGAGTAGGC 
1141 ATCAAGATTG GGAGAGGCGA GCCAGAGAAG AGGTGTTTCA AGTCTTTGGT AATCAGAAAC 
12 01 CAGATTTTGA CGGATTGAAT CGTCTAAAAG TTGTGACAAT GATCTTGTAT GAGTCTTTAA 

12 61 GGCTATACTC CCCAGTAGTG TCACTAATCC GGCGGCCTAA TGAGGATGCT ATATTAGGAA 

13 21 ATGTATCTCT GCCAGAAGGT GTGCTACTCT CATTACCAGT GATCTTATTA CACCACGATG 
13 81 AAGAGATATG GGGTAAAGAT GCAAAGAAGT TCAATCCAGA AAGATTTAGA GATGGAGTCT 
1441 CAAGTGCAAC AAAGGGTCAA GTCACTTTTT TTCCATTTAC TTGGGGTCCC AGAATATGCA 
1501 TCGGACAAAA TTTTGCCATG TTAGAAGCAA AGACTGCTTT GGCTATGATC CTACAACGCT 
1561 TCTCATTCGA ACTGTCTCCA TCTTATGCAC ATGCTCCTCA GTCCATATTA ACTATGCAAC 
1621 CCCAACATGG TGCTCCACTA ATTCTGCACA AAATATAGTT TGTTACTTTA AGCAGTGTCT 
16 81 TGTTATATGT CAGAGAGTCC AAAATGTTTA ATTAAGGCTT GTAGAACTGC CAAATGGAAC 
1741 TTCATTTGCA TTCGTGGGTT GTAGATTGTT GTAATTGGAC AAGTATACTG TTTATTTTAG 
1801 AGTTTTAAGA AAAAAAAAA 



SEQ. ID. NO. 2 96 

1 METVQIIITA SCAAIIITLV VCIWRVLNWV WFRPKKLEKL LRKQGLKGNS YKILYGDMKE 

61 LSGMIKEANS KPMNLSDDIA PRLVPFFLDT IKKYGKKSFV WLGPKPLVLI MDPELIKEIF 

121 SKYYLYQKPH GNPVTKLLVQ GLVSLEEDKW AKHRKIINPA FHLEKLKHML PAFCLSCTEM 

181 LCKWEDIVSI KGSHEIDVWP HLEQLSSDVI SRTAFGSNFE EGKRIFELQK EQAQYFVEAI 

241 RSVYIPGWRF LPTKRNRRMK EVEKDVRASI RGIIDKRVKA MKAGEASNED LLGILLESNF 

301 TEAEQHRHKD SAMSIEEVIQ ECKLFYVAGQ ETTSVLLVWT LILLSRHQDW QSRAREEVFQ 

361 VFGNQKPDFD GLNRLKWTM ILYESLRLYS PWSLIRRPN EDAILGNVSL PEGVLLSLPV 

421 ILLHHDEEIW GKDAKKFNPE RFRDGVSSAT KGQVTFFPFT WGPRICIGQN FAMLEAKTAL 

481 AMILQRFSFE LSPSYAHAPQ SILTMQPQHG APLILHKI 
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Figure 149: Amino Acid Identity of Group Members 



Group 1 

AQIiAINIiVTSMLGHLriHHPTWAPAPGVNPBDIDLEESPGTVTYMKNPIQ^ 

I 

AQLAIWLVTSMLGHLLHHPTWAPPPGVNPEMIDLEESPGTVTYMKNPIQAIPTPIUjPAHLYGRV 



SEQ ID No. :2 D58-BG7 
(98,5) 

SEQ ID No. :4 D58-AB1 



Group 2 

QIiAINIiVTSMLGHLFIILHGLRPRGLTRRIIjTWRRALEQ 



SEQ ID No. :8 DS8-BE4 



Group 3 

EGrlAVRMViU^SLGCrrQCFDWQRIGEELVDMTEGTGtlTLPKAQPLVAKCSPRPK^^^ 

I II I 

BGIiAIRMVALSLGCIIQCFDWQRLGEGLVDKTEGTGLTLPKAQPLVAKCSPRPIMAmiLSQI 



SEQ ID No.: 10 D56-AH7 
(93.5) 

SEQ ID No.:12 D13a-5 



Group 4 

IQFATIiVTHIiTFQRliLQGFDFSKPSNTPIDNrrEGVGVTLPKVNQVEVLITPRIiPSKIiYriF 

I I II 

IMFATLVTHLTPGRLLQGPDPSTPSirrPIDMTEGVGWLPKTNQV^IVrilSPRLPSKLYVP 



SEQ ID No.:14 D56-AG10 
(93.3) 

SEQ ID No. ;18 D34-62 



Group 5 

XIIiAIiPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHIIiKHSTIVLKPRSF 

I 

rriiAIiPILGITLGRLVQNPEIiLPPPGQSKIiDTTEKGGQPSLHILKHSTIVMKPRSP 

I 1 

rilAIiPILGITLGRIiVQNPEliLPPPGQSKLDTTEKGGQFSIiHirjKHSTIVIjKPRSC 



SEQ ID No.;20 D56-AA7 
{98.2) 

SEQ ID No,:144 D185-BD3 
(96.4) 

SEQ ID No.:22 D56-AE1 



Group 6 

lAIiGVASMBliALSNIiLYAPDWEIiPFGMKKJBDIDTNARPGITMHKKHSLYIiI 

II I I 

lALGVASMBLALSNLLYAPDWELPYGVKOINIDTNWPGITMHKKHELCLIPRNYL 

I I 

rAIJGVASMBIlALlSNLLYAFDWELPYGVKI□EDIDTNVRPGIA^lHKi^ 

III 

rALGVASMEIALSNIiLYAPDWELPYGVKKBDIDTNWPGIAMHKKNBLCLVPK^ 



SEQ ID No.:24 D35-BB7 
(92.8) 

SEQ ID No.:2€ D177-BA7 
(96.4) 

SEQ ID No.:28 D56A-AB6 
(94.6) 

SEQ ID No.:30 D144-AB2 



Group 7 

ISFGLANAYLPLAQIiLYHFDWBLPTQIKPSDLDIjTELVGVTAARKSDDYLVATPYQPPQN 

III I 
ISPGLANAYLPLAQIiLYHFDWKLPAGIBPSDLDLTELVGVTAARKSDtiYLVATPYQPPQK 



SEQ ID No.t32 D56-AG11 
(93.3) 

SEQ ID No-:34 D179-AA1 



Group 8 

MLFGXiANVGQPLAQLLYHFDWBCLPNGQSHENFDMTESPGISATRIODDLVXilATPYDSY 

I ! I II 

MLFGIiANVGQPIiAQLIiYHFDWKLPNGQTHQNFDMTESPGISATRKDDLIIilATPAHS 



SEQ ID NO.:36 D56-AC7 
(91.2) 

SEQ ID No,;38 D144-AD1 
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Group 9 

LLFGLVNVGHPLAQLLYHFDWKTLPGrSSDSFDMTETDGVTAGRIODDLCLIATPFGI^ 

Group 10 

MSFGLATlTXGHPLiAQLLYPFDWKFPHKVNAADFHTTETSRVFAASiaDDLyLIPT^^ 

I M I I I 

MSFGLVNTGHPI^QLLYCFDWKLPDKVNANDPRTTETSRWAMKDDLYLIPTNHREQE 



SEQ ID No. :40 D144-AB5 



SEQ ID No. :42 D181-AB5 
(89.8) 

SEQ ID No. :44 D73-Ac9 



Group 11 

MQFGLALVTJLPLAHLLHNFDWKLPBGINARDLDMTEANGISARREKDLYLIATPYVSP^ 



SEQ ID lTo.:46 D56-AC12 



Group 12 

MTYALQVEHLTMAHLIQGFNYRTPTDEPLDMKEGAGITIRKWPVKVIITPRLAPELY 

III II I 

MTYALQVEHIiTMAHLIQGFlNlYKTPNDEALDMKEGAGITIRKVNPV^LIIAPRIi^ 

I 

MTYALQVEHLTMAHL IQGFNYKTPNDEALDMKEGAGITIRKVNPVELI ITPRIiAPELY 

I I I 

MTYALQVEHLTMAHL IQGFNYRTPNDEPLDMKEGAGITIRKVNPVBLI I APRLAPELY 

I 

MTYALQVEHLTMAHLIQGFimiTPMDEPLDMKEGAGITIRKVNPVSLIIAP-LAPELY 

I 

MTYALQVEHLTMAHLIQQFlTYRTPl^EPLDMKEGAGITIRKVNPABljIIAPRLAPELY 

II II lllll 

MTYALQVEHLTIAHLIQGFiryKTPNDBPLDMKBQAaLTIRKVNPVEVTTTARIiA^ 

I 

MTYALQVEHLTIAHLIQGFirYKTPNDEPLDMKEGAGLTIRKVNP'^/BVTlTARLAPELY 



SEQ ID No.:4S D58-AB9 
(89.6) 

SEQ ID No.:50 D56~AG9 
(98.2) 

SEQ ID No.: 52 D56-AG6 
(94.8) 

SEQ ID No.:54 D35-BG11 
(98.3) 

SEQ ID NO. r56 D3 5-42 
(98.3) 

SEQ ID No.: 58 D35-BA3 
(84,5) 

SEQ ID No. :60 D34-57 
(98-3) 

SEQ ID No.:62 D34-52 



Group 13 

YSLGLKVIRVTIA^mriHGFNWKLPEGMKPBDISVEEHYGLTTHPICFPVPVILESRLSSDLYSPIT 



SEQ ID No.:66 D56-AD10 



Group 14 

YSLGIRIIRATIJy^fLLHGFNWRLPNGMSPEDISMEEIYGLrTHPKVALDVMMEPRLPNHLYK 



SEQ ID Mo. :6a D56-AA11 



Group 15 

INFS I PLVEIiALANLLFHYTWSLPEGMLAKIDVDMEEALG IT^mKK3PLCLV 

I II 
INFSIPLVELAIANLLFHYNWSLPEGMLPKDVDMEEALGITMHKKSPLCLVASHYNLL 



SEQ ID No.;70 D177-BD5 
(94.7) 

SEQ ID No.: 84 D177-BD7 



Group 16 

MQLGLYALEMAVAHLUjCFTWBIiPDaMKPSELKMDDIPGIiTAPRANRtiVAVPSPRLIiCPIiY 

I I 
MQLGLYALEMAVAHLLHCPTWELPDGMKPSELKMDDIFGLTAPRANRIiVAVPTPRLLCPL 

I 

MQLGLYALEMAVAHLI^CFTWELPDGMKPSELKMDDIPGLTAPKANRLVAVPTPRLIiCPIiY 



SEQ ID No.:74 D50-BC5 
(96.7) 

SEQ ID No.:76 D58-AD12 
(98.4) 

SBQ ID No.: 72 D56A-AG10 



Group 17 

MLWSASIVRVSYLTCIYRFQVYAGSVFRVA 



SEQ ID MO,:78 D56-AC11 
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MliWSASIVRVSYLTCIYRFQVYAGSVSRVA 



(96.7) 

SEQ ID No.:88 D56-AD6P 



Group 18 

LNPAMLBAKMALALILQHyAPELSPSYAHAPHTltTLQPQHOAPLILRKIi 



SEQ ID No.: SO D73A-AD6 



Group 19 

QNFAILEAKmiAMILQRFSFELSPSYTHSPrrVVTLKPKYGAPLIMHRL 

I II II MM Ml M 

QNfFAMLEAKMALSMILQRFSFELSPSYAHAPQSIliTVQPQYGAPLIFHKIj 

III! I III I 

IMFAMTEAKMAMAMILQRFSFELSPSYTHAPQSVITMQPQYGAPLILHKL 

INFAMAEAKMAMAMILQRFSPELSPSYTHAPQSVITMQPQYGAPLILHKLi 

I I I II Mil II MM 

QNFAMMEAKMAVA^^IriHKFSFELSPSYTHAPFAIVTIHPQYaAPliIJMRRI- 

I 

QNFAMMEAKMAVAMILQK^'SFELSPSYTHAPFAIVTIHPQYGAPIjLMRRIj 



SEQ ID No.: 96 D70A-AB5 

(72.0) 

SEQ ID No.:100 D70A-AB8 
(82 .0) 

SEQ ID NO.:102 D70A-BH2 
08. 0) 

SEQ ID NO.:104 D70A-AA4 
(70.0) 

SEQ ID No. -.108 D70A-BA9 
(98.0) 

SEQ ID No.:106 D70A-BA1 



Group 20 

QNFAMLEAKMAMAMILKTYAFELSPSYAHAPHPIiliLQPQYGAQLILYKL 



SEQ ID No.! 110 D70A-BD4 



Group 21 

YSMGLKAIQASIJmijLHGFNWSIiPDNMTPBDLNMDEIFGLSTPKKPPIiATVIBPRI.SP^ 
YsioLKBI QASIiAlinijLHGPNWSLPDNMTPEDLNMDE 
HSLGLK^IQASLAMLLHGFl^mSLPDNMTPBDLNMDBIFGLSTPK^ 



SEQ ID NO.J112 D181-AC5 
(96.8) 

SEQ ID No.:114 D144-'AH1 
(96.8) 

SEQ ID No. :116 D34-65 



Group 22 

LCFPCLISSYIIjALNVNIjYHNFLQISPSISY 



SEQ ID No.ill8 D35-BG2 



Group 23 

SGIiAQCWGLAIiATLVQCFEWKRVSEEVVDLTEGKaLTMPKPBPLMARCEARDIFHK^ 



SEQ ID No,: 120 D73A-AH7 



Group 24 

LGIiATVHVNLMLARMIQEPEWSAYPENRKVDLLRNWNIjLW 

IIIIIMII 

LGLATVTm^IjMIjARMIQEFEWSAyPENRK\7DPTEKLEPTVVM 
LGLATVHVNLMIJUiTIQEFEWSAYPENRECVDFTEK^ 



SEQ ID No.:136 D185-BG2 
(77.5) 

SEQ ID NO.:122 D58-AA1 
(98.2) 

SEQ ID No.;134 D185-BC1 
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Group 25 

YAIAMLHLE YFVANLVraFRWEAVEGDDVDIiSEiaBFTVVM I 



SEQ ID No.:124 D73A-AB10 



Group 26 

QQVGLLRTTI FI AS LLS EYKLKPRSHQKQVEIjTDLNPASWLjHS IKGELLVDAI PRKKAAF 



SEQ XD No.:126 D56A-AC12 



Group 27 

ITFAKFVNEIiAIJUiLiMFHFDFSLPKGVKHEDLDVEEAAGITV^ 

I 

ITFAKFVNEIiAIiARIiMFHFDFSLPKaVKHADLDVEBAAGITViaiOT 



SEQ ID No.:128 D177-BF7 
(98.2) 

SEQ ID No.:140 D185-BD2 



Group 28 

QRYAINHLMLFIAIiPTAIilDPKRHKTDGCDDIAYIPTIAPKDDCKVPLSQRCTRFPSFS 



SEQ ID Ho.: 130 D73A-AG3 



Group 29 

MSFGLANLYLPLAQLLYHFDWKIiPTGIKPRDLDLTELSGITIARKGDLYLNATPYQPSRS 

I III I I I I i I I I 

ISFGIiAlSnr^LPLAQLLYHPDWKLPTGINSSDIiDMTESSGWCARKSDLYDTATPYQIiSQB 



SEQ ID No.:132 D70A-AA12 
(80 .0) 

SEQ ID NO.: 86 176-BF2 



Group 30 SEQ ID No.:146 D176-BC3 

QNFAMLEAKTTLAMILQRPSFELSPSYAHAPQSXITCNPSMVLHLFCIKYSLLLVSSVSFYVKHESK^ 



Group 31 

ADMGIiRAVSLALGALIQCFDWQIEEABSIiBESYNSRMTMQNKPIjKWCTPREDLGQLIiSQIj 



SEQ ID No.:148 D176-BB3 



Group 32 

^mYSIiQVEHLSIAH^IIQGPSFATTTNEPLDMKQGVGLTLPKKTDVEVLITPRLPPTLYQY 



SEQ ID No.:6 DI86-AH4 



The percentage identity between most related pairs is noted in (0.0%). Each group had at 
least 70% identity to another group member. Group 19 contained the lowest percentage 
identity at 7 0.0%. 
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FIGURB 150 X COMPARISON OF SBQUBNCB 6ROX7PS 

AIiIGNHENT OF GROUP 1 

D58-BG7 GCy^cy^CTTGCTATCAACTTGGTCACATCTATGTTGGGTCATTTGTTGCATCATTT SEQ ID No 1 

D58-AB1 GCy^CAACTTGCTATCAAGTTGGTCa^Cy^TCTATGTTGGGTCATTTaTTGCaVTCATT^^ SEQ ID No 3 

D58-BB4 GCACAACTTQCTATCAACTTGGTCACATCTATGTTGGGTCATTTGTT-CATCA SEQ ID No 7 

************ 

D5 8 - BG7 TGGGCTCCGGCCCCGGGGGTTAACCCGGAGGATATTGACTTGGAGGAGAGCCCTGGAACA 

I I 
D5 8 - ABl TGGGCTCCGCCCCCGGGGGTTAACCCGGAGAATATTGACTTGGAGGAGAGCCCTGGAACA 

I I 
D5 8 - BB4 TGGGCTCCGGCCCCGGGGGTTAACCCGGAGGATATrGACTTGGAGGAGAGCCCTGGAACA 
********* ******************i** ***************************** 

D5 8 -BG7 GTAACTTACATGAAAAATCCAATACAAGCTATTCCAACTCCAAGATTGCCTGCACACTTG 

I 

DS 8 - ABl GTAACTTACATGAAAAATCCAATACAAGCTATTCCTACTCCAAGATTGCCTGCACACTTG] 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIMIM 

D5 8 - BE4 GTAACTTACATGA - - 

************* 

D 5 8 - BG7 TATGGACGTGTGCCAGTGGATATGTAA 

D58 - ABl TATGGACGTGTGCCAGTGGATATGTAA 

IMIIIIMIIIIIIIIIIMIMIM 

D53-BE4 r 

PERCEliyr IDENTITY OF GROUP 1 

D5 8-BG7 P5 8-BE4 D58-AB1 

D58-BG7 *** 96.2 98.1 SEQ ID No 1 

D58-BE4 *** 94.0 SEQ ID No 7 

D58-AB1 *** SEQ ID No 3 

ALIGNMENT OF GROUP 2 

D56-AH7 GAAGGATTGGCTGTTCGAATGGTTGCCTTGTCATTGGGATGTATTATTCAATGTTTTGAT SEQ ID No 9 

I I I 

D13a-5 GAAGGATTGGCTATTCGAATGGTTGCATTGTCATTOGGATGTATTATTCAATGCTTTGAT SEQ ID No 11 

************ ************* *********************i»f**** ****** 

D5€ -AH7 TGGCAACGTVATCGGCGAAGAATTGGTTGATATGACTGAAGGAACTGGACTTACTTTGCCT 

Mil I 
D13a- 5 TGGCAACGACTTGGGGAAGGATTGGTTGATAAGACTGAAGGAACTGGACTTACTTTGCCT 
********* * ** **** *********** **************************** 

D56 - AH7 AAAGCTCAACCTTTGGTGGCCAAQTGTAGCCCACGACCTAAAATGGCTAATCTTCTCTCT 

I I I 

D13a - 5 AAAGCTCAACCTTTAGTGGCCAAGTGTAGCCCACGACCTATAATGGCTAATCTTCTTTCT 
************** ************************* *************** *** 

D56-AH7 CAGATTTGA 

D13a-5 CAGATTTGA 
********* 

PERCENT IDENTITY OF GROUP 2 

D56~AH7 D13a-5 

D56-AH7 *** 93.7 SEQ ID No 9 

D13a-5 *** SEQ ID No 11 
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FIGDRB 150 s COMPARISON OF dBQTTSKCS GROUPS 

AI.IGNMEKT OF GROTTP 3 

D56-AG10 ATAGGTTTTGCGACTTTAGTGACACATCTGACTTTTGGTCGCTTGCTTCAAGGTTTTGAT SEQ ID No 13 

I 

D35-33 ATAGGCTTTGCGACTTTAGTGACACATCTGACTTTTGGTCGCTTGCTTCAAGGTTTTGAT SEQ ID No 15 

III 

D34-S2 ATAAATTTTGCGACTTTAGTGACACATCTGACTTTTGGTCGCTTGCTTCAAGGTTTTGAT SEQ ID No 17 

D5 6 - AGIO TTTAGTAAGCCATCAAACACGCCAATTGACATGACAGAAGGCGTAGGCGTTACTTTGCCT 

D3 5 - 3 3 TTTAGTAAGCCATCAAACACGCCAATTGACATGACAGAAGGCGTAGGCGTTACTTTGCCT 

t 1 
D3 4 - 6 2 TTTAGTACGCCATCAAACACGCCAATAGACATOACAGAAGGCGTAGGCGTTACTTTGCCT 

D5 6 - AGIO AAGGTTAATaU^QTTGAAGTTCTAATTACCCCTCGTTTACCTTCTAAGCTTTATTTATTTTGA 

I I 

D3 5 - 3 3 AAGGTTAATCAAGTTGAAGTTCTAATTACCCCTCXSTTTACerTCTAAGCTTTATTTAT 

II I IN 

D3 4 - 62 AAGGTAAATCAAGTGGAAGTTCTAATTAGCCCTCX3TTTACCTTCTAAGCTTTATGTATTCTGA 
***** ******** ************* ************************* *** 

PERCENT IDENTITY OP GROUP 3 

D5g-AQ10 035-33 D34-62 

D56-Aai0 *** 98.9 9S.1 SEQ ID No 13 

035-33 *** 94,4 SEQ ID No 15 

D34-62 *** SEQ ID No 17 

\ 

AXiiaNMEMT OF QROtTP 4 



D5 6 - AA7 ATTATACTTGCATTGCCAATTCTTGGCATCACTTTGGGACGTTTGGTTCAGAACTTTGAG 

I 

D5 6 - AEl ATTATACTTGCATTGCCy^TTCTTGGCATTACTTTGGGACGTTTGGTTCAGAACTTTGAG 

II III 

Dl 8 5 -BD 3 ATTATCCTTGCACTGCCAATTCTTaaCATTACCTTGGGACGCTTGGTGCAGAACTTTGAG 

***flr* ****** **************** ** ******** ***** ************ 

D5 6 - AA7 CTGTTGCCTCCTCCAGGCCAGTCGAAGCTCGACAGCACAGAGAAAGGTGGACAGTTCAGT 

D5 6 -AEl CTGTTGCCTCCTCCAGGCCAGTCGAAGCTCGACACCACAGAGAAAGGTGGACAGTTCAGT 

I I I I I II 

Dl a 5 -BD3 TTGTTGCCTCCTCCAGGACAGTCAAAGCTTQACACAACAGAGAAAGGCGGGCAATTCAGT 
**************** ***** ***** ***** *********** ** ** ****** 

D56-AA7 CTCCACATTITGAAGCATTCCACCATTGTGTTGAAACCAAGGTCTTTCTGA 

I 

056-7^1 CTCCATATTTTGAAGO^TTCCACCATTGTGTTGAAACCAAGGTCTTGCTGA 

II I I II I 

D185-BD3 CTGCACATTTXGAAGCATTCCACCATTGTGATGAAACCAAGATCTTTTTAA 
** ** ************************ ********** **** * * 

PERCENT IDENTITY OF GROUP 4 

D56AA7 D56-AS1 D185-BP3 

D56AA7 *** 98.2 87.7 SEQ ID No 19 

D56-AE1 *** 87.1 SEQ ID No 21 

D18S-BD3 *** SEQ ID No 143 
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FIGURE 150 s COMPARISON OF SBQVBNCB GROUPS 



ALIGMfMBUT OF GROUP 5 



D35-BB7 

D177-BA7 

D144-AE2 

D56A-AB6 
D3 5-BB7 
D177-BA7 
D144-AE2 

D56A-AB6 
D35-BB7 
D177-BA7 
D144-AE2 



ATTGCACTTGGGGTTGCATCCATGGAACTTGCTTTGTCAAATCTTCTTTATGCATTTGAT SEQ ID No 27 

ATTGGACTTGOGaTTGCTVTaU^TGGAACTTGCATTGTCAAATCTTCTTTATGCATTT^^ SEQ ID No 23 

ATTGCACTTGGGGTTGCATCCy^TGGAACTTGCTTTGTCAAATCTTCTTrATCCATTTGAT SEQ ID No 25 

ATTGCACTTGGGGTTGCATCCATGGAACTTGCTTTGTCy^TCn'TCTTTATGCATTT^^ SEQ ID No 29 

***A****4r****««***«« ************************* 

TGGGAGTTGCCTTATGQAGTOAAAAAAGAAGACATCGACACAAACGTTAGGCCTGGAATT 

III I II 

TGGGAGTTACCTTTTGGAATGAAAAAAGAAGACATTGACACAAACGCCAaGCCTQGAATT 

III I II 

TGGGAGTTACCTTACGGAGTGAAAAAAGAAAACATTGACAGAAATGTCAGGCCTGGAATT 
II III 

TGGGAGTTGCCTTATGGAGTGAAAAAAGAAGACATCGACACAAACGTTAGGCCTaOAATT 
■kitlcifk*** •k'k'k'k *** ***yr**^**** itit-h* ******** * 

GCCATGCACAAGAAAAACGAACTTTGCCTTGTCCGAAAAAA-TTATTTATAA 

II II I I II 
ACCATGCATAAGAAAAACGAACTTTATCTTATCCCTAAAAA- TTATCTATAG 



II 

ACCATGCATAAGAAAAACGAACTTTGCCTTATCCGTAGAAA-TTATCTATAG 

GCCATGGACAAGAAAAACGAACTTTGCCTTGTCCa^AAAAAArTATTTATAAA 

******* **************** *** **** * ititif Aftttf, **** 



D56A-AB6 
D35-BB7 
D177-BA7 
D144-AE2 
PERCEirr IDENTITY OF GROUP 5 

D56A-AB6 D35-BB7 



llllllllllllllllllllllll 
ATTGGaACGTGGATCTCATGCTAG 



D56A-AB6 
D35-BB7 
D144-AE2 
D177-BA7 



90.6 
*** 



D144-AE2 



97.1 
87.7 
*** 



D177-BA7 



91.8 
93.0 
88.9 
**« 



SEQ ID NO 27 
SEQ ID No 23 
SEQ ID No 29 
SEQ ID No 25 



ALIGNMENT OF GROUP 6 



D5 6 - AGl 1 ATTTCGTTTGGTTTAGCTAATGCTTATTTGCCATTGGCTCAATTACTTTATCACTTTGAT 

I I I 

D179-AA1 ATTTCGTTTGGCTTAGCTAATGCTTATTTGCCATTGGCTCAATTACTATATCACTTCGAT 
*********** *********************************** ******** 

D56 - AGl 1 TGGGAACTCCCCACTGGAATCAAACCAAGCGACTTGGACTTGACTGAGTTGGTTGGAGTA 

I II I 

D179-AA1 TGGAAACTCCCTGCTGGAATCGAACCAAGCGACTTGGACTTGACTGAGTTGGTTGGAGTA 

*** ******* ********************************.*.***** 

D56 - AGl 1 ACTGCCGCTAGAAAAAGTGACCTTTACTTGGTTGCGACrCCTTATCAACCTCCTCAAAACTGA 

D179-AA1 ACTGCCGCTAGAAAAAGTGACCTTTACTTGGTTGCGACTCCTTATa^CCTCCTCAAAAGT^ 
**************** 
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FXGURB 150 s COMPARISON OF SEQUENCB QROITPS 

PBRCKHT IDENTITY OP GROUP 6 

SEQ ID No 31 SEQ ID No 33 

D56-AG11 D179-AA1 

D56-AG11 *** 95.6 SEO ID No 31 

D179-AA1 *** SEQ ID No 33 

AIiIGMMBNT Og GROUP 7 

D56-AC7 ATGCTATTTGGTTTAGCTAATGTTGGACAACCTTTAGCTCAGTTACTTTATCACTTCGAT SEQ ID No 35 

D144-AD1 ATOCTATTTGGTTTAGCTAATGTTGGACAACCTTTAGCTCAGTTACTTTATCACTTCGAT SEQ ID No 37 

******** **************************i»f ************************* 

D5 6 - AC7 TGGAAACTCCCTAATGGACAAAGTCATGAGAATTTCGACATGACTGAGTCACCTGGAATT 

I II I 

D144 -ADl TGGAAACTCCCTAATGGAC^UU^CTCACCAAAATTTCGACATGACTGAGTCACCTaGAATT 
********************** **^^f * ****************************** 

D 5 6 - AC7 TCTGCTACAAGAAAGGATGATCTTGTTTTGATTGCCACTCCTTATGATTCTTATTAA 

I II I 

D 1 4 4 - ADl TCTGCTACAAGAAAGGATGATCTTATTTTGATTGCCACTGCTGCTCATTCTTGA 
************************ ***************** * *^^«^^ 

PBRCEITT IDENTITY OF GROUP 7 

D144-AP1 D56-AC7 

D144-AD1 *** 94.3 SEQ ID No 37 

D56-AC7P *** SEQ ID NO 3 5 

AIiIGNHENT OF GROUP 9 

D181-AB5 ATGTCGTTTGGTTTAGTTAACACTGGGCATCCTTTAGCTCAGTTGCTCTATTTCTTTGAC SEQ ID No 41 

I I I 

D73-AC9 ATGTCGTTTGGTTTAGTTAACyVCAGGGCATCCTTTAGCCCAGTTGCTCTATTGCTTTGAC SEQ ID No 43 

*********************** ************** ************* itit^iHt** 

Dl 8 1 - AB 5 TGGAAATTCCCTCATAAGGTTAATGGAQCTGATTTTCACACTACTGAAACAAGTAGAGTT 

III II I 

D7 3 - AC9 TGGAAACTCCCTGACAAGGTTAATGCAAATGATTTTCGCACrACTGAAACAAGTAGAGTT 
****** ***** * ************ ******** ********************** 

Dl 8 1 - AB 5 TTTGCAGCAAGO^AAGATGACCTCTACTTGATTCCTVAaU^TCACATGGAGCT^ 

D7 3 - AC9 TTTGCAGCAAGCAAAGATGACCTCTACTTGATTCCCS^CZ^AATCA^ 

*********************************** ********** **«******. **^^ 

PERCENT IDENTITY OF GROUP 9 

D181-AB5 D73-AC9 

D181-ABS *** 92,8 SEQ ID No 41 

D73-AC9 *** SEQ ID No 43 
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FXOURB 150 1 COMPARISON OF 3BQUBNCB GROUPS 

AiiiGNMEyrr of group ii 

D58-AB9 ATGACITATGCyVTTGCAAGTGGAAa^CCTAACAATGGCAC^ SEQ ID No 47 

SSQ ID No 4 9 



D56-AG9 



ATGACTTATGCATTGCAAGTGGAACUVCCTAACAATGGCACy^TTTAATCCAa 

D3 5-BG11 ATGACTrATGCyVTTGCAAGTGGAACACTTAAaUVTaGa^(^^^ SEQ ID No 53 

I II I 

ATGACTTATGCa.TTACAAGTGGAAa^CCTAACAATAaa^CATTTGATCC:AGGGTO^ SEQ ID No 63 

I II I 

ATGACTTATGCATTGCAAGTGGAACACTTAAa^TGaCACATTTGATCC^ SEQ ID No 57 

ATGACTTATGCATTACAAGTGGAACACCTAAC7UVTAGCACATTTGATC SEQ ID No 61 

ATGACTTATGCATTGCAAGTGGAACACCTAACAATGGCACATTri^TCC^ SEQ ID No 51 

< I I 

ATGACTTATGCATTGCAAaTGGAACACTTAACAATGGCACATTTCSATCCAAGGTTTCAAT SEQ ID No 55 
I II I 

ATGACTTATGCATTACAAGTGGAACACCTAACAATAGCACATTTGATCCAGGGTTTCAAT SEO ID No 59 
************** ************ ******* ******** ***** ********* ^ 



D34-25 
D35-BA3 
D34-52 
D56-AG6 
D35-42 
D34-57 



D58-AB9 

D56-AG9 

D35-BG11 

D34-25 

D35-BA3 

D34-52 

D5€-AG€ 

D35-42 

D34-57 

D58-AB9 
D56-AG9 
D35-BG11 
■ D34-25 
D35-BA3 
D34-52 
D56-AG6 



TACAOAACrCCAACTGATGAGCCCTTGGATATGAAAGAAGGTGCAGGCATAACTATACGT 
III I 
ATGAC( 

ATGACC 

I I II 

TACAAAACTCCAAATGACGAGCCCCTGGATATGAAGGAAGGTGCAGGATTAACTATACGT 
I I M 



III I 
TACATJ^CTCCAAATQACGAGGCCTTGGATATGAAGGAAGGTGCAGGCATA^^ 

TACAGAACTCCMATGACGAGCCCTTGGATATGAAGGAAGGTGCAGGCy^TAACTATACGT 

I I i 



' I II 

TACy^GAACTCCaAATGACGAGCCCTTGGATATGAAGGAAGGTGCAGGCATAACTATACGT 

I I I 

TACAAAACTCCAAATGACGAGCCCTTGGATATGAAGGAAGGTGCAGGATTAACTATACGT 

I I I 

TACAAAACTCCa^TGACGAGGCCTIXKSATATGAAGGAAGGTGCAGGCATAACAATACGT 

' I I 

TACAGAACTCCAAATGACGAGCCCTTGGATATGAAGGAAGGTGCAGGCATAACTATACGT 

I III 
TACAAAACTCC?VAATGACGAGCCCTTGGATATGAAGGAAGGTGCAGGATTAACCATACGT 
**** ******^* ******★***,*********** **** ****** 

AAGGTAAATCCTGTGAAAGTGATAATTACGCCTCGCTTGGCACCTGAGCTTTATTAA 

II I I 

AAGGTAAATCCTGTGGAACTGATAATAGCGCCTCGCCTGGCACCTGAGCTTTATTAA 



AAGGTAAATCCTGTGGAACTGATAATAGCGCCTCGCCTGGCACCTGAGCTTTATTAA 
I II II I 

7AGAAGTGACAATTACGG 

i II I 

:ggaactgataatagcgc 

II II I 

'AGAAGTGACAATTACGG 

I III III j 

AAGGTAAATCCAGTOGAATTGATAATAACGCCTCGCTTGCGACCTOAGCTTTACTAA 

I i j I 1 1 1 I 



I II Ml 

AAAGTAAATCCTGTAGAAGTGACAATTACGGCTCGCCTGGCACCTGAGCTTTATTAA 

I M II I 

AAGGTAAATCCTGCGGAACTGATAATAGCGCCTCGCCTGGCACCTGAGCTTTATTAA 

I Ml II I 

AAAGTAAATCCTGTAGAAGTGACAATTACGGCTCGCCTGGCACGTGAGCTTTATTAA 

I III III I 



D35-42 AAGGTAAATCCTGTGGAACTGATAATAGCGCCCC- - 

I I I III II III 

AAAGTAAATCCTGTAGAAQTGACAACTACGGCTCGCCTGGCACCTGAGCTTTATTAA 

** ******** * ** *** ** ** * * **************** *** 



D34-57 
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FIGURBI 150 s COMPARISON OF SBQITBNCB GROUPS 

PERCENT IDENTITY OF GROUP 11 



D58-AB9 

D56-AG9 

D56-AG6 

D35-BG11 

D35-42 

D35-BA3 

D34-57 

D34-52 

D34-25 



D58-AB9 



D56-AG6 



D35-42 



D34-57 



D34-.25 



D56-AG9 



D35-BG11 



D35-BA3 



D34-52 



93.8 



93 .2 
96.6 
*** 



94 .3 
97.2 
93.8 

-k-kit 



90.8 
94.2 
90.2 
97.1 
Keltic 



93 .2 
96.6 
92.6 
99.4 
96 . 5 
*** 



90 
91 
90 
90 
87 
90 
*** 



92 
92 
90 
92 
88 
91 
98 



91 
92 
90 
91 
87 
90 
98 
99.4 
*** 



SBQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 



ID No 47 
ID No 49 
ID NO 51 
ID No 53 
ID No 55 
ID No 57 
ID No 59 
ID No 61 
ID NO 63 



ALIGNMEKT OF GROUP 14 

D177-BD7 ATTAATTTTTCAATACCACTTGTTGAGCTTGCACTTaCTAATCTATTGTTTCATTATAAT SEQ ID No 83 

D177-BD5 

D177-BD7 
D177-BD5 

D177-BD7 
D177-BD5 



ATTAATTTTTCAATACCACTTGTTGAGCTTGCACTTGCTAATCTATTGTTTCATTATAAT SEQ ID No 69 
TGGTCACTTCCTGAGGGGATGCTACCTAAGGATGTTGATATGGAAGAAGCTTTGGGGATT 

I I 

TGGTCACTTCCTGAAGGGATGCTAGCTAAGGATGTTGATATGGAAGAAGCTTTGGGGATT 

* ************* ie**ieifk-k*ie itititit***lt-klt1eie*iiitie*ltit*'k***-k'k-k**-k'tfk-k-k* 

ACCATGCACAAGAAATCTCCCCTTTGCTTAGTAGCTTCTCATTATAACTTGTTGTGA 

I III 

ACCATGCACAAGAAATCTCCCCTTTGCTTAGTAGCTTCTCATTATA- CTTGTTGA- - 

Ke*it ft it ieit it-kit it* iri^*** it* * It it it* it Ktie It it* ieititieieit ieic* it ieie it ** ******* 



PERCENT IDENTITY OF GROUP 14 
D177-BD7 



D177-BD7 
D177-BD5 



D177-BD5 



96.0 
*** 



SEQ ID NO 83 
SBQ ID No 69 



ALIGNMENT OF GROUP 15 

D56A-AG10 ATGCAACTTGGGCTTTATGCATTGGAAATGGCTGTGGCCCATCTTCTTCATTGTTTTACT SEQ ID No 71 

D58-AD12 ATGCAACTTGGGCTTTATGCATTGGAAATGGCTGTGGCCCATCTTCTTCATTGTTTTACT SEQ ID No 75 

II II 
D58-BC5 ATGCAACTTGGGCTTTATGCATTAGAAATGGCAGTGGCCCATCTTCTTCTTTGCTTTACT SEQ ID No 73 

*********************** **************** *** itie**** 

TGGGAATTGCCAGATGGTATGAAACCAAGTGAGCTTAAAATGGATGATATTTTTGGACTC 
TGGGAATTGCCAGATGGTATGAAACCAAGTGAGCTTAAAATGGATGATATTTTTGGACTC 

TGGGAATTGCCAGATGGTATGAAACCAAGTGAGCTTAAAATGGATGATATTTTTGGACTC 

it****ifkit***i(*ie*****iei(ir****it**ir******it**ir**ie***ir**ieicir***ific*ic* 

ACTGCTCCAAAAGCTAATCGACTCGTGGCTGTGCCTACTCCACGTTTGTTGTGTCCGCTT 

I 

ACTGCTCCAAGAGCTAATCGACTCGTGGCTGTGCCTACTCCACGTTTGTTGTGTCCCCTT 

I I I 

ACTGCTCCAAGAGCTAATCGACTCGTGGCTGTGCCTAGTCCACGTTTGTTGTGCCCACTT 
********** ************************** *************** ** *** 



D56A-AG10 

D58-AD12 

D58-BC5 

D56A-AG10 

D58-AD12 

D58-BC5 
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FIGURE 150 s COMPARISON 07 SEQUENCE GROUPS 

D56A-AG10 TATTAi- 
D58-AD12 TATTAA 
D58-BCS TATTAA 

PERCENT IDENTITY OF GROUP 15 

D5 6 A" AGIO D58-AD12 D58-BC5 

D56A-AG10 *** 99.5 95.7 SEQ ID No 71 

D5a-Abl2 *** 96.2 SEO ID No 75 

D58-BC5 *** SEQ ID No 73 

AIiIGNMBNT OF GROUP 16 

D56-AD€ ATGCTTTC3GAGTGGGAGTATAGTGCGCGTCAGCTACCTAACTTGTATTTATAGATTCCAA SEQ ID Mo 87 

D56-AC11 ATGCTTTGGAGTGCGAGTATAGTGCOCGTCAaCTACCTAACTTaTATTTATAGATTCCAA SEQ ID No 77 

D35-39 ATGCTTTGGAGTGCGAGTATAGTGCGCGTCAGCTACCTAACTTGTATTTATAGATTCCAA SEQ ID No 79 

I 

D58-BH4 ATGCTTTGGAGTGCGAGTATAGTGCGCGTCAGCTACCTAACCTGTATTTATAGATTCCAA SEQ ID No 81 

****************** 

D5 6 - AD 6 aTATATGCTGGGTCTGTGTCCAGAGTAGCATGA 

I 

D5 6 - ACl 1 GTATATGCTGGGTCTGTGTTCAGAGTAGCATGAD3 5-39 

GTATATGCTGGQTCTQTGTTCAGAQTAGCATGA 

D58 -BH4 GTATATGCTGGGTCTGTGTTCAGAGTAGCATGA 

*-k*<k*ie-fc-k***'kit'k-k*iit'k'k ************* 

PERCENT IDENTITY OF GROUP 16 

D56-AC11 D56-AD6 D58-BH4 D35--39 

D56-AC11 *** 98.7 98.7 98.7 SEQ ID No 77 

D56-AD6 98.7 98.7 SEQ ID No 87 

D58-BH4 *** 98.7 SEQ ID No 81 

D35-39 *** SEQ ID No 79 

AlilGNMENT OF GROUP 17 

D73A-ADS CTGAATTTTGCAATGTTAGAGGCAAAAATGGCACTTGCATTGATTCTACAACACTATGCT SEQ ID No 8 9 

D70A-BA11 CTGAATTTTGCAATGTTAGAGGCAAAAATGGCACTTGCATTGATTCTACAACACTATGCT SEQ ID No 91 

************************************************************ 

D7 3 A- ADS TTrGAGCTCTCTCCATCTTATGCACATGCTCCTCATACAATTATCACTCTGCAACCTCAA 

I 

D70A-BA11 TTTGAGCTCTCTCCATCTTATGCACACGCTCCTCATACAATTATCACTCTGCAACCTCAA 
************************** ********************************* 

D7 3 A- AD6 CATGGTGGTCCTTTGATTTTGCGCAAGCTGTAG 

D7 OA- BAl 1 CATGGTGCTCCTTTGATTTTGCGCAAGCTGTAG 
********************************* 
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7ZCSURB 150 1 COMPARXSON OF SBQUEirCB GROUPS 

PERCENT IDBOTXT^ OF' GROUP 17 

D73A-AD 70A-BA11 
D73A-AD6 *** 99.3 SEQ ID No 89 

D70A~BA11 *** SEQ ID No 91 

AltlGMMBNT OF GROUP 18 

D70A-AB5 CAAAACTTCGCGATTTTGGAAGCAAAAATGGCTATAGCTATGATrCTACAACGCTTCT^ SEQ ID No 95 



D70A-AA8 CAAAACTTCGCGATTTTGGAAGCAAAAATGGCTATAGCTATGATrCTACAACGCTTCTCC 



SEQ ID No 97 



D70A-AB5 



TTCGAGCTCTCCCCATCTTATACACACTCTCCATACACTGTGGTCACTTTGAAACCCAAA 



D70A-AA8 TTCCSAGCTCTCTCCATCTTATACACACTCTCCATACACrGTGGTCACTTTGAAACCC^^ 
***************** ****************************** 

D7 OA- AB5 TATGGTGCTCCCCTAATAATGCACAGGCTGTAG 
D7 0 A- AA8 TATGGTGCTCCCCTAATAATGCACAGGCTGTAG 
*********************** It 

PERCENT IDENTITY OF GROUP 18 

D70A-AB5 D7 0A-AA8 

D70A-AB5 *** 99.6 SEQ ID No 95 

D70A-AA3 *** SEQ ID No 97 

AlilGNMENT OF GROUP 19 

D70A-AB8 CyU^TTTTGCCATGTTAGAAGCAAAGATGGCTCTGTCTATGATCCTGCAACGCTT^ SEQ ID No 99 

N I I 1 1 I II I I 

D70A-BH2 ATAAACTTTGCAATGACAGAAGCGAAGATGGCTATGGCTATGATTCTGCAACGCTTCTCC SEQ ID No 101 

D70A-AA4 ATAAACTTTGCAATGGCAGAAGCGAAGArGGCTATGGCTATGATTCTGCAACGCTTCTCC SEQ ID NO 103 

*** ***** *** ****** **** ** ** ******* ** *********** 

D70A-AB8 TTTGAACTGTCTCCGTCTTATGCACATGCCCCTCAGTCCATATTAACCGT-CAGCCA 

D70A-BH2 TTTGAGCTATCTCCy^TCTTAC^iCACATGCTCCACAGTCTGTAATAACTA^^ 

D70A-AA4 TTTGAGCTATCTCa^TCTTACACACATGCTCCACa^GTCTGTAATAACTATGCAACCCayi 
***** ** ***** ***** ******* ** ***** ** **** ^ 

D7 0 A- AB 8 TATGGTGCTCCACTTATTTTCCACAAGCTArAA 

D70A-BH2 TATGGTGCTCCTCTTATATTGCa^CAiATTGTAA 
D7 0 A- AA4 TATGGTGCTCCTCTTATATTGCACAAATTGTAA 
*********** ***** ** ** ** * ** 

PERCENT IDENTITY OF GROUP 19 

D70A-ABa D70A-AA4 D7 0A-BH2 

D70A-AB8 *** 77.8 77.8 SEQ ID No 99 

D70A-AA4 *** 99.3 ggQ id No 101 

D70A--BH2 *** SEQ ID No 103 

ALIGNMENT OF GROUP 20 

D70A-BA1 CAAAACTTTGCAATGATGGAAGCAAAAATGGCAGTAGCTATGATACTAGAAAAATTTTCC SEQ ID NO 105 



101/141 



wo 2005/038033 



PCT/US2004/034065 



rXQVKB 150 1 COMPARISON OF SEQUB^^ GROUPS 



D70A-BA9 



CAAAACTTTGCy^ATGATGGAAGCAAAAATGGCa^GTAGCTATGATACTACATA^ SEQ ID No 107 

it it -kit* it* -kit 



D7 0 A- BAl TTTGAACTATCCCCTTCTTATACACATGCTCCATTTGC^ATTGTGACrATTCATCCTCAG 

D7 0 A- BA9 TTTGAACTATCCCCTTCTTATACACyVTGCTCCATTTGCAATTGTGACTATTCATCCTCAG 
**#w*«****it***^ **************************** ****** 

D7 0 A- BAl TArGGTGCTCCTCTGCTTATGCGCAGACTTTAA 

D70A-BA9 TATGGTGCTCCTCTGCTTATGCGCAGACTTTAA 
★ ***^i^******************** 



PERCENT IDENTITY OF GROUP 20 



D70A-BA1 



D70A-BA1 
D70A-BA9 



D70A-BA9 



99.4 
*** 



SEQ ID NO 105 
SEQ ID No 107 



ALIGNMEiNT OP GROTTP 22 



D144-AH1 TATAGCTTGGGGCTCAAGGAGATTCAAGCTAGCTTAGCTAATCTTCTACATGGATTTAAC SEQ ID No 113 

I I 

D34-65 CA.TAGCTTGGGGCTCAAGGTGATTCAAGCTAGCTTAGGTAATCTTCTACATGGATTTAAC SEQ ID No 115 

I I I 

D181-AC5 TATAGCATGGGGCTCAAGGCGATTCAAaCTAGCTTAGCTAATCTTCTACATGGATTTAAC SEQ ID No 111 

*****.************ ******* *************************4-***««*« 

D144 -AHl TGGTCATTGCCTGATAATATGACTCCTGAGGACCTCAACATGGATGAGATTTTTGGGCTC 

D3 4 - 6 5 TGGTCATTGCCTGATAATATGACTCCTGAGGACCTCAACATGGATGAGATTTTTGGGCTC 

Dl 8 1 - ACS TGGTCATTGCCTGATAATATGACTCCTGAGGACCTCAACATGGATGAGATTTTTGGGCTC 
**************************************«-****«*********.*#*«*** 

Dl 4 4 - AH 1 TCTACACCTAAAAAATTTCCACTTGCTACTGTGATTGAGCCAAGACTTTCACCAAAACTT 

D3 4 - 6 5 TCTACACCTAAAAAATTTCCACTTGCTACTGTGATTGAGGCAAGACTTTCACCAAAACTT 

Dl 8 1 - AC5 TCTACACCTAAAAAATTTCCACTTGCTACTGTGATTGAGCCAAGACTTTCACCAAAACTT 
************************************************************ 

D14 4 -AHl TACTCTGTTTGA 

D3 4 - 6 5 TACTCTGTTTGA 

D181-AC5 TACTCTGTTTGA 
************ 

PERCENT IDENTITY OP GROUP 22 

D34-65 D181-AC5 D144-AH1 

D34-65 *** 98.4 99.0 SEQ ID NO 115 

D181-AC5 *** 99.0 SEQ ID No 111 

D144-AH1 *** SEQ ID No 113 

ALIQNMEUT OF GROUP 25 

D58-AA1 TTGGGCTTGGCAACGGTGCATGTGAATTTGATGTTGGCCCGAATGATTCAAGAATTTGAA SEQ ID No 121 

D185-BC1 TTGGGCTTGGCAACGGTGCATGTGAATTTGATGTTGGCCCGAACGATTCAAGAATTTGAA SEQ ID No 133 
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FI6TIRB 150 1 COMPARISON OF SBQDBNCB GROTTPS 

I 

D185-BG2 TTGGQCTTGGO^CGGTGCATGTGAATTTGATGTTGGCCCGAATGATTCyUl SEQ ID No 135 

**itiiitit^tit**it*ii**itit'kirit1t'kieik*'k*itlt*'k-k***it'k4it*'kiiitie *4r4(V(r4r-* ********** 

DSa - AAl TGGTCCGCTTACCCGGAAAATAGGAAAGTGGATTTTACTGAGAAATTGGAATTTACTGTG 

D185-BC1 TGGTCCGCTTACCCGOAAAATAGGAAAGTGGATTTTACTGAGAAATTGGAATTTACTGTG 

I 

D185 -BG2 TOGTCCGCTTACCCGGAAAATAaGAAAGTOGATTT- ACTGAGAAATTGQAATTTACTGTG 

DBS -AAl GTGATGAAAAATCCTTTAAGAGCTAAGGTCAAGCCAAGAATGCAAGTGGTGTAA 

I 

D185 -BCl GTGATGAAAAACCCTTTAAGAGCTAAGGTCAAGCCAAGAATGCAAGTGGTGTAA 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIillllllllllllllllllll 

D18S-BG2 GTGA 

PERCENT IDENTITY OF GROUP 2 5 

D58~AA1 D185-BG2 D185--BC1 

D58-AA1 *** 95.9 98.9 SEQ ID No 121 

D185-BG2 *** 95.1 SEQ ID No 135 

D185-BC1 *** SEQ ID No 133 



AIiIGNMBliT OP GROUP 28 

D177-BF7 ATCACATTTGCTAAGTTTaTGAATOAGCTAaCATTGGCAAGATTAATGTTCCATTTTGAT SEQ ID No 127 

D185-BD2 ATCACyVTTTGCTAAGTTTGTGAATGAGCTAGCATTGGCTVAGATTAATGTTCCATTTTGAT SEQ ID No 139 

D185-BE1 ATCACATTTGCTAAGTTTGTGAATGAGCTAGCATTGGCAAGATTAATGTTCCATTTTGAT SEQ ID No 137 

****** ************************************** 

D177-BF7 TTCTCGCTACCAAAAGGAGTTAAGCATGAGGATTTGGACGTGGAGGAAGCTGCTGGAATT 

I 

DISS -BD2 TTCTCGGTACCAAAAGGAGTTAAGCATGCGGATTTGGACGTGGAGGAAGCTGCTGGAATT 

I 

D185-BE1 TTCTCGCTACCAAAAGGAGTTAAGCATGAGGATTTGGACGTGGAGGAAGCTGCTGGAATT 
****************************** ******************************* 

D177 -BF? ACTGTTAGAAGGAAGTTCCCCCTTTTAGCCGTCGCCACTCCATGGTCGTGA 

D1B5~BD2 ACTGTTAGAAGGAAGTTCCCCCTTTTAGCCGTCGCCACTCCATGCrCGTGA 

I 

Dl 8 5 - BE 1 ACTGTTAGGAGGAAGTTCCCCCTTTTAGCCGTCGCCACTCCATGCTCGTGA 

******** *************«r***** *********************** 

PERCENT IDENTITY OF GROUP 2 8 

D177-BF7 D185-BD2 D185-BE1 

D177-BF7 *** 99.4 99.4 SEQ ID No 127 

D185-BD2 *** 98.8 SEQ ID No 139 

D185-BE1 *** SEQ ID No 137 



ALIGNMBMT OF GROUP 30 

D70A-AA12 ATGTCATTTGGTTTAGCTAATCTTTACTTACCATTGGCTCAATTACTCTATCACTTTGAC SEQ ID No 131 

I I I I I I I II I I I 

D176-BP2 ATATCATTTGGTTTGGCTAATGTTTATTTGCCACTAGCTCAATTGTTATATCATTTTGAT SEQ ID No 85 
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FIOXmB 150 1 COUPARISON OF SBQUBKC6 GR0T7PS 

** *********** ****** **** ** itit* * ******** * ***** ***** 

D7 OA- AAl 2 TGGAAACTCC(::AACCGGAATCAAGCa^GAGACrrTGGACTTGACCGAATTATCG{K5A^ 

I I > II I I I I II I I 

D176 -BF2 TGGAAACTCCCTACTGGAATO^TTC/^GTGACTTGGACATGACTGAGTCGTCAGGAGTA 
*********** ** ******** **** ^j^^^^^^jt* **** ** * ** **« If* 

D7 OA- AAl 2 ACTATTGCTAGAAAGGGTGACCTTTACTTAAATGCTACTCCTTATCAACCTTCTCGAGAGTAA 

II I II I II I III 

Dl 7 6 - BF2 ACTTGTGCTAGAAAGAGTGATTTATACTTGACTGCTACTCCATATCAACTTTCTCAAGAGTGA 

*** ********** **** * ***** * itit-tslHeifkifie ******* ***** ^ * * * ^ 

PERCENT IDENTITY OF GROUP 30 

D176-BF2 D7QA-AA12 
D176-BF2 *** 77.0 SEQ ID Mo 85 

D70A-AA.12 *** SEQ ID No 131 
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FIG. 153 

NAME D425-AB10 
ORGANISM NICOTIANA TABACUM 
SEQ ID 356 

1 ATGCTTTCTC CCATAGAAGC CATTGTAGGA 
61 CTATGGACAA AAAAATCTCA AAAACCTTCA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC 
181 CGAAAACTCG GAGACTTAGC TGACAAATAC 
241 CCCCTTGTCT TAGTTGTAAG CAGTTACGAA 
3 01 GCCATTTTTT CCAATCGTCC AGCTTTTCTT 
3 61 ATGCTATTTT TGGCCAATTA CGGACCTTAC 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA 
481 GCGAGCATTA AGAATTTATA TACTCGAATT 
541 GATTGGTTAG AAGAATTGAA TTTTGGTCTG 
601 GAATCCGGTA AAGGAGATGA ACAATTGGAG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT 
721 GATTTTCAAG GGCATGTTAA GGCTATGAAA 
781 CAGAATTGGT TAGAGGAACA TATTAATAAA 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT 
901 GGTTACTCTC GTGATACTGT CATTAAAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA 
12 01 CACATTCCTA AAGGGACAAG ATTATTCGCA 

12 61 CTCTGGTCTG ATCCTGATAC TTTCGATCCA 

13 21 CGTGGTCAGT ACTATAAGTA TATCCCGTTT 
13 81 ACTTATGCAT TGCAAGTGGA ACACTTAACA 
1441 AGAACTCCAA ATGACGAGCC CTTGGATATG 
1501 GTAAATCCTG TGGAACTGAT AATAGCGCCT 
1561 AGATCATCTT GOT 



CTAGTAACCT TCACATTTCT CTTCTTCTTC 
AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
AATGACGACG GCGACGACCG TCCATTAGCT 
GGCCCCGTTT TCACTTTTCG GCTAGGCCTT 
GCTGTAAAAG ACTGTTTCTC TACAAATGAC 
TACGGCGATT ACCTTGGCTA CAATAATGCC 
TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
TTCAAACACG TGAGATTTGC AAGAATTGAA 
GATGGAAATT CGAGTACGAT AAATTTAACT 
ATCGTGAAGA TGATCGCTGA AAAAAATTAT 
AGATTTAAGA AAGCGTTTAA GGATTTTATG 
GCATTTCCAA TTCCATTATT TAAATGGGTG 
AGGACTTTTA AAGATATAGA TTCTGTTTTT 
AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
TCAAAAATGA GTAATGAATA TCTTGGTGAA 
ACGGTGTTTA GTTTGGTCTT GGATGCAGCA 
ATGGCATTAT TGATAAACAA TCAAAAGGCC 
AAAGTTGGTA AGGACAGATG GGTAGAAGAG 
GCTATTGTTA AAGAAGTGTT ACGATTATAT 
AATGTAGAAG ATTGTGTTGT TAGTGGATAT 
AACGTCATGA AACTGCAACG TGATCCTAAA 
GAGAGATTCA TTGCTACTGA TATTGACTTT 
GGTTCTGGAA GACGATCTTG TCCAGGGATG 
ATGGCACATT TGATCCAAGG TTTCAATTAC 
AAGGAAGGTG CAGGCATAAC TATACGTAAG 
CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 



SEQ XD 357 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS 
61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK 
181 DWLEELNFGL IVKMIAEKNY ESGKGDEQLE 

2 41 DFQGHVKAMK RTFKDIDSVF QNWLEEHINK 

3 01 GYSRDTVIKA TVFSLVLDAA DTVALHINWG 
3 61 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE 
421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP 



KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
FKHVRFARIQ ASIKNLYTRI DGNSSTINLT 
RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 
REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
MAIiLINNQKA LTKAQEEIDT KVGKDRWVEE 
NVEDCWSGY HIPKGTRLFA NVMKLQRDPK 
GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
RLAPELY 
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FIG. 154 

NAME D425-AB11 

ORGANISM NICOTIANA TABACUM 
SEQ XD 358 

1 ATGCTTTCTC CCATAGAAGC CATTGTAGGA 
61 CTATGGACAA AAAAATCTCA AAAACCTTCA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC 
181 CGAAAACTCG GAGACTTAGC TGACAAATAC 
241 CCCCTTGTCT TAGTTGTAAG CAGTTACGAA 
3 01 GCCATTTTTT CCAATCGTCC AGCTTTTCTT 
3 61 ATGCTATTTT TGGCCAATTA CGGACCTTAC 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA 
481 GCGAGCATTA AGAATTTATA TACTCGAATT 
541 GATTGGTTAG AAGAATTGAA TTTTGGTCTG 
601 GAATCCGGTA AAGGAGATGA ACAAGTGGAG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT 
721 GATTTTCAAG GGCATGTTAA GGCTATGAAA 
7 81 CAGAATTGGT TAGAGGAACA TATTAATAAA 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT 
9 01 GGTTACTCTC GTGATACTGT CATTAAAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA 
12 01 CACATTCCTA AAGGGACAAG ATTATTCGCA 

12 61 CTCTGGTCTG ATCCTGATAC TTTCGATCCA 
1321 CGTGGTCAGT ACTATAAGTA TATCCCGTTT 

13 81 ACTTATGCAT TGCAAGTGGA ACACTTAACA 
1441 AGAACTCCAA ATGACGAGCC CTTGGATATG 
1501 GTAAATCCTG TGGAACTGAT AATAGCGCCT 
1561 AGATCATCTT GCT 



CTAGTAACCT TCACATTTCT CTTCTTCTTC 
AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
AATGACGACG GCGACGACCG TCCATTAGCT 
GGCCCCGTTT TCACTTTTCG GCTAGGCCTT 
GCTGTAAAAG ACTGTTTCTC TACAAATGAC 
TACGGCAATT ACCTTGGCTA CAATAATGCC 
TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
TTCAAACACG TGAGATTTGC AAGAATTGAA 
GATGGAAATT CGAGTACGAT AAATTTAACT 
ATCGTGAAGA TGATCGCTGG AAAAAATTAT 
AGATTTAAGA AAGCGTTTAA GGATCTTATG 
GCATTTCCAA TTCCATTATT TAAATGGGTG 
AGGACTTTTA AAGATATAGA TTCTGTTTTT 
AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
TCAAAAATGA GTAATGAATA TCTTGGTGAA 
ACGGTGTTTA GTTTGGTCTT GGATGCAGCA 
ATGGCATTAT TGATAAACAA TCAAAAGGCC 
AAAGTTGGTA AGGACAGATG GGTAGAAGAG 
GCTATTGTTA AAGAAGTGTT ACGATTATAT 
AATGTAGAAG ATTGTGTTGT TAGGGGATAT 
AACGTCATGA AACTGCAACG TGATCCTAAA 
GAGAGATTCA TTGCTACTGA TATTGACTTT 
GGTTCTGGAA GACGATCTTG TCCAGGGATG 
ATGGCACATT TGATCCAAGG TTTCAATTAC 
AAGGAAGGTG CAGGCATAAC TATACGTAAG 
CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 



SEQ ID 359 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS 

61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE 

121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK 

181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE 

2 41 DFQGHVKAMK RTFKDIDSVF QNWLEEHINK 

3 01 GYSRDTVIKA TVFSLVLDAA DTVALHINWG 
3 61 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE 
421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP 



KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
AVKDCFSTND AIFSNRPAFL YGNYLGYNNA 
FKHVRFARIQ ASIKNLYTRI DGNSSTINLT 
RFKKAFKDLM ILSMEFVLWD AFPIPLFKWV 
REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
MALLINNQKA LTKAQEEIDT KVGKDRWVEE 
NVEDCWRGY HIPKGTRLFA NVMKLQRDPK 
GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
RLAPELY 
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FXO. 155 

NAME D425-AC9 

ORGANISM NICOTIANA TABACUM 

SEQ ID 360 

1 ATGCTTTCTC CCATAGAAGC CATTGTAGGA 
61 CTATGGACAA AAAAATCTCA AAAACCTTCA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC 
181 CGAAAACTCG GAGACTTAGC TGACAAATAC 
241 CCCCTTGTCT TAGTTGTAAG CAGTTACGAA 
3 01 GCCATTTTTT CCAATCGTCC AGCTTTTCTT 
3 61 ATGCTATTTT TGGCCAATTA CGGACCTTAC 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA 
481 GCGAGCATTA AGAATTTATA TACTCGAATT 
541 GATTGGTTAG AAGAATTGAA TTTTGGTCTG 
601 GAATCCGGTA AAGGAGATGA ACAAGTGGAG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT 
721 GATTTTCAAG GGCATGTTAA GGCTATGAAA 
781 CAGAATTGGT TAGAGGAACA TATTAATAAA 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT 
901 GGTTACTCTC GTGATACTGT CATTAAAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA 
12 01 CACATTCCTA AAGGGACAAG ATTATTCGCA 
12 61 CTCTGGTCTG ATCCTGATAC TTTCGATCCA 
1321 CGTGGTCAGT ACTATAAGTA TATCCCGTTT 
1381 ACTTATGCAT TGCAAGTGGA ACACTTAACA 
1441 AGAACTCCAA ATGACGAGCC CTTGGATATG 
1501 GTAAATCCTG TGGAACTGAT AATAGCGCCT 
1561 AGATCATCTT GOT 



CTAGTAACCT TCACATTTCT CTTCTTCTTC 
AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
AATGACGACG GCGACGACCG TCCATTAGCT 
GGCCCCGTTT TCACTTTTCG GCTAGGCCTT 
GCTGTAAAAG ACTGTTTCTC TACAAATGAC 
TACGGCGATT ACCTTGGCTA CAATAATGCC 
TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
TTCAAACACG TGAGATTTGC AAGAATTGAA 
GATGGAAATT CGAGTACGAT AAATTTAACT 
ATCGTGAAGA TGATCGCTGG AAAAAATTAT 
AGATTTAAGA AAGCGTTTAA GGATTTTATG 
GCATTTCCAA TTCCATTATT TAAATGGGTG 
AGGACTTTTA AAGATATAGA TTCTGTTTTT 
AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
TCAAAAATGA GTAATGAATA TCTTGGTGAA 
ACGGTGTTTA GTTTGGTCTT GGATGCAGCA 
ATGGCATTAT TGATAAACAA TCAAAAGGCC 
AAAGTTGGTA AGGACAGATG GGTAGAAGAG 
GCTATTGTTA AAGAAGTGTT ACGATTATAT 
AATGTAGAAG ATTGTGTTGT TAGTGGATAT 
AACGTCGTGA AACTGCAACG TGATCCTAAA 
GAGAGATTCA TTGCTACTGA TATTGACTTT 
GGTTCTGGAA GACGATCTTG TCCAGGGATG 
ATGGCACATT TGATCCAAGG TTTCAATTAC 
AAGGAAGGTG CAGGCATAAC TATACGTAAG 
CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 



SEQ XD 361 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS 
61 RKLGDLADKY , GPVFTFRLGL PLVLWSSYE 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK 
181 DWLEELNFGLi IVKMIAGKNY ESGKGDEQVE 
241 DFQGHVKAMK RTFKDIDSVF QNWIiEEHINK 
301 GYSRDTVIKA TVFSLVLDAA DTVALHINWG 
3 61 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE 
421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP 



KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
FKHVRFARIQ ASIKNIiYTRI DGNSSTINLT 
RFKKAFKDFM ILSMEFVIiWD AFPIPLFKWV 
REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
MALLINNQKA LTKAQEEIDT KVGKDRWVEE 
NVEDCWSGY HIPKGTRLFA NWKLQRDPK 
GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
RLAPELY 
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PIG. 156 

NAME D425-AC10 
ORGANISM NICOTIANA TABACUM 
SEQ ID 362 



1 CATAATGCTT TCTCCCATAG AAGCCATTGT 
61 CTTCCTATGG ACAAAAAAAT CTCAAAAACC 
121 AGGATGGCCG GTAATCGGCC ATCTTTTCCA 
181 AGCTCGAAAA CTCGGAGACT TAGCTGACAA 
241 CCTTCCCCTT GTCTTAGTTG TAAGCAGTTA 
3 01 TGACGCCATT TTTTCCAATC GTCCAGCTTT 
3 61 TGCCATGCTA TTTTTGGCCA ATTACGGACC 
421 TCAGGAAGTT CTCTCCGCTA GTCGTCTCGA 
481 TCAAGCGAGC ATTAAGAATT TATATACTCG 
541 AACTGATTGG TTAGAAGAAT TGAATTTTGG 
601 TTATGAATCC GGTAAAGGAG ATGAACAAGT 
661 TATGATTTTA TCAATGGAGT TTGTGTTATG 
721 GGTGGATTTT CAAGGGCATG TTAAGGCTAT 
781 TTTTCAGAAT TGGTTAGAGG AACATATTAA 
841 AGGGAATGAA CAAGATTTCA TTGATGTGGT 
901 TGAAGGTTAC TCTCGTGATA CTGTCATTAA 
961 AGCAGACACA GTTGCTCTTC ACATAAATTG 
1021 GGCCTTGACG AAAGCACAAG AAGAGATAGA 
10 81 AGAGAGTGAT ATTAAGGATT TGGTATACCT 
1141 ATATCCACCA GGACCTTTGT TAGTACCACA 

12 01 ATATCACATT CCTAAAGGGA CAAGATTATT 
1261 TAAACTCTGG TCTGATCCTG ATACTTTCGA 
1321 CTTTCGTGGT CAGTACTATA AGTATATCCC 

13 81 GATGACTTAT GCATTGCAAG TGGAACACTT 
1441 TTACAGAACT CCAAATGACG AGCCCTTGGA 
1501 TAAGGTAAAT CCTGTGGAAC TGATAATAGC 
1561 CCTAAGATCA TCTTGCT 



AGGACTAGTA ACCTTCACAT TTCTCTTCTT 
TTCAAAACCC TTACCACCGA AAATCCCCGG 
CTTCAATGAC GACGGCGACG ACCGTCCATT 
ATACGGCCCC GTTTTCACTT TTCGGCTAGG 
CGAAGCTGTA AAAGACTGTT TCTCTACAAA 
TCTTTACGGC GATTACCTTG GCTACAATAA 
TTACTGGCGA AAAAATCGAA AATTAGTTAT 
AAAATTCAAA CACGTGAGAT TTGCAAGAAT 
AATTGATGGA AATTCGAGTA CGATAAATTT 
TCTGATCGTG AAGATGATCG CTGGAAAAAA 
GGAGAGATTT AAGAAAGCGT TTAAGGATTT 
GGATTCATTT CCAATTCCAT TATTTAAATG 
GAAAAGGACT TTTAAAGATA TAGATTCTGT 
TAAAAGAGAA AAAATGGAGG TTAATGCAGA 
GCTTTCAAAA ATGAGTAATG AATATCTTGG 
AGCAACGGTG TTTAGTTTGG TCTTGGATGC 
GGGAATGGCA TTATTGATGA ACAATCAAAA 
CACAAAAGTT GGTAAGGACA GATGGGTAGA 
CCAAGCTATT GTTAAAGAAG TGTTACGATT 
CGAAAATGTA GAAGATTGTG TTGTTAGTGG 
CGCAAACGTC ATGAAACTGC AACGTGATCC 
TCCAGAGAGA TTCATTGCTA CTGATATTGG 
GTTTGGTTCT GGAAGACGAT CTTGTCCAGG 
AACAATGGCA CATTTGATCC AAGGTTTCAA 
TATGAAGGAA GGTGCAGGCA TAACTATACG 
GCCTCGCCTG GCACCTGAGC TTTATTAAAA 



SEQ ID 363 



1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS 

61 RKIiGDLADKY GPVFTFRLGL PLVLWSSYE 

121 MLFLANYGPY WRKNFOOiVIQ EVIiSASRLEK 

181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE 

2 41 DFQGHVKAMK RTFKDIDSVF QNWLEEHIMK 

3 01 GYSRDTVIKA TVFSLVLDAA DTVALHINWG 
3 61 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE 
421 LWSDPDTFDP ERFIATDIGF RGQYYKYIPF 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP 



KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
FKHVRFARIQ ASIKNLYTRI DGNSSTINLT 
RFKKAFKDFM ILSMEFVI>WD SFPIPLFKWV 
REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
MALLMNNQKA LTKAQEEIDT KVGKDRWVEE 
NVEDCWSGY HIPKGTRLFA NVMKLQRDPK 
GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
RLAPELY 
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FIG. 157 

NAME D425-AC11 
ORGANISM NICOTIANA TABACUM 
SBQ ID 364 

1 ATGCTTTCTC CCATAGAAGC CATTGTAGGA CTAGTAACCT TCACATTTCT CTTCTTCTTC 
61 CTATGGACAA AAAAATCTCA AAAACCTTCA AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC AATGACGACG GCGACGACCG TCCATTAGCT 
181 CGAAAACTCG GAGACTTAGC TGACAAATAC GGCCCCGTTC TCACTTTTCG GCTAGGCCTT 
241 CCCCTTGTCT TAGTTGTAAG CAGTTACGAA GCTGTAAAAG ACTGTTTCTC TACAAATGAC. 
3 01 GCCATTTTTT CCAATCGTCC AGCTTTTCTT TACGGCGATT ACCTTGGCTA CAATAATGCC 
3 61 ATGCTATTTT TGGCCAATTA CGGACCTTAC TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA TTCAAACACG TGAGATTTGC AAGAATTCAA 
481 GCGAGCATTA AGAATTTATA TACTCGAATT GATGGAAATT CGAGTACGAT AAATTTAACT 
541 GATTGGTTAG AAGAATTGAA TTTTGGTCTG ATCGTGAAGA TGATCGCTGG AAAAAATTAT 
601 GAATCCGGTA AAGGAGATGA ACAAGTGGAG AGATTTAAGA AAGCGTTTAA GGATTTTATG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT GCATTTCCAA TTCCATTATT TAAATGGGTG 
7 21 GACTTTCAAG GGCATGTTAA GGCTATGAAA AGGACTTTTA AAGATATAGA TTCTGTTTTT 
7 81 CAGAATTGGT TAGAGGAACA TATTAATAAA AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT TCAAAAATGA GTAATGAATA TCTTGGTGAA 
901 GGTTACTCTC GTGATACTGT CATTAAAGCA ACGGTGTTTA GTTTGGTCTT GGATGCAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA ATGGCATTAT TGATAAACAA TCAAAAGGCC 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA AAAGTTGGTA AGGACAGATG GGTAGAAGAG 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA GCTATTGTTA AAGAAGTGTT ACGATTATAT 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA AATGTAGAAG ATTGTGTTGT TAGTGGATAT 
1201 CACATTCCTA AAGGGACAAG ATTATTCGCA AACGTCATGA AACTGCAACG TGATCCTAAA 
1261 CTCTGGTCTG ATCCTGATAC TTTCGATCCA GAGAGATTCA TTGCTACTGA TATTGACTTT 
1321 CGTGGTCAGT ACTATAAGTA TATCCCGTTT GGTTCTGGAA GACGATCTTG TCCAGGGATG 
13 81 ACTTATGCAT TGCAAGTGGA ACACTTAACA ATGGCACATT TGATCCAAGG TTTCAATTAC 
1441 AGAACTCCAA ATGACGAGCC CTTGGATATG AAGGAAGGTG CAGGCATAAC TATACGTAAG 
15 01 GTAAATCCTG TGGAACTGAT AATAGCGCCT CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 
1561 AGATCATCTT GOT 



S&Q XD 365 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS 
61 RiCLGDLADKY GPVFTFRLGL PLVLWSSYE 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK 
181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE 
241 DFQGHVKAMK RAFKDIDSVF QNWLEEHINK 
3 01 GYSRDTVIKA TVPSLVLDAA DTVALHINWG 
3 61 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE 
421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP 



KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
AVKDCFSTND AIFSNRPAFL YGDYLGYWNA 
FKHVRFARIQ ASIKNLYTRI DGMSSTINLT 
RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 
REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
MAIiLINNQKA LTKAQEEIDT KVGKDRWVEE 
NVEDCWSGY HIPKGTFtLFV NVMKLQRDPK 
GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
RLAPELY 
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FIG. 158 

NAME D425-AG11 

ORGANISM NICOTIANA TABACUM 
SEQ ID 366 

1 ATGCTTTCTC CCATAGAAGC CATTGTAGGA 
61 CTATGGACAA AAAAATCTCA AAAACCTTCA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC 
181 CGAAAACTCG GAGACTTAGC TGACAAATAC 
2tl CCCCTTGTCT TAGTTGTAAG CAGTTACGAA 
301 GCCATTTTTT CCAATCGTCC AGCTTTTCTT 
361 ATGCTATTTT TGGCCAATTA CGGACCTTAC 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA 
481 GCGAGCATTA AGAATTTATA TACTCGAATT 
541 GATTGGTTAG AAGAATTGAA TTTTGGCCTG 
601 GAATCCGGTA AAGGAGATGA ACAAGTGGAG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT 
721 GATTTTCAAG GGC ATGTTAA GGCTATGAAA 
7 81 CAGAATTGGT TAGAGGAACA TATTAATAAA 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT 
901 GGTTACTCTC GTGATACTGT CATTAAAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA 
12 01 CACATTCCTA AAGGGACAAG ATTATTCGTA 
1261 CTCTGGTCTG ATCCTGATAC TTTCGATCCA 
1321 CGTGGTCAGT ACTATAAGTA TATCCCGTTT 
1381 ACTTATGCAT TGCAAGTGGA ACACTTAACA 
1441 AGAACTCCAA ATGACGAGCC CTTGGATATG 
1501 GTAAATCCTG TGGAACTGAT AATAGCGCCT 
1561 AGATCATCTT GOT 



CTAGTAACCT TCACATTTCT CTTCTTCTTC 
AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
AATGACGACG GCGATGACCG TCCATTAGCT 
GGCCCCGTTT TCACTTTTCG GCTAGGCCTT 
GCTGTAAAAG ACTGTTTCTC TACAAATGAC 
TACGGCGATT ACCTTGGCTA CAATAATGCC 
TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
TTCAAACACG TGAGATTTGC AAGAATTGAA 
GATGGAAATT CGAGTACGAT AAATTTAACT 
ATCGTGAAGA TGATCGCTGG AAAAAATTAT 
AGATTTAAGA AAGCGTTTAA GGATTTTATG 
GCATTTCCAA TTCCATTATT TAAATGGGTG 
AGGGCTTTTA AAGATATAGA TTCTGTTTTT 
AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
TCAAAAATGA GTAATGAATA TCTTGGTGAA 
ACGGTGTTTA GTTTGGTCTT GGATGCAGCA 
ATGGCATTAT TGATAAACAA TCAAAAGGCC 
AAAGTTGGTA AGGACAGATG GGTAGAAGAG 
GCTATTGTTA AAGAAGTGTT ACGATTATAT 
AATGTAGAAG ATTGTGTTGT TAGTGGATAT 
AACGTCATGA AACTGCAACG TGATCCTAAA 
GAGAGATTCA TTGCTACTGA TATTGACTTT 
GGTTCTGGAA GACGATCTTG TCCAGGGATG 
ATGGCACATT TGATCCAAGG TTTCAATTAC 
AAGGAAGGTG CAGGCATAAC TATACGTAAG 
CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 



SEQ ID 367 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS 

61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE 

121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK 

181 DWLEELNFGIi IVKMIAGKNY ESGKGDEQVE 

241 DFQGHVKAMK RAFKDIDSVF QNWLEEHINK 

301 GYSRDTVIKA TVFSLVLDAA DTVALHINWG 

3 61 SDIKDLVYLQ AIVKEVLKLY PPGPLLVPHE 

421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF 

481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP 



KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
FKHVRFARIQ ASIKNLYTRI DGNSSTINLT 
RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 
REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
MALLINNQKA LTKAQEEIDT KVGKDRWVEE 
NVEDCWSGY HIPKGTRLFV NVMKLQRDPK 
GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
RliAPELY 
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FIG. 159 

NAME D425-AH7 

ORGANISM NICOTIANA TABACUM 
SBQ ID 368 

1 ATGCTTTCTC CCATAGAAGC CATTGTAGGA 
61 CTATGGACAA AAAAATCTCA AAAACCTTCA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC 
181 CGAAAACTCG GAGACTTAGC TGACAAATAC 
241 CCCCTTGTCT TAGTTGTAAG CAGTTACGAA 
301 GCCATTTTTT CCAATCGTCC AGCTTTTCTT 
361 ATGCTATTTT TGGCCAATTA CGGACCTTAC 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA 
4 81 GCGAGCATTA AGAATTTATA TACTCGAATT 
541 GATTGGTTAG AAGAATTGAA TTTTGGTCTG 
601 GAATCCGGTA AAGGAGATGA ACAAGTGGAG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT 
721 GATTTTCAAG GGCATGTTAA GGCTATGAAA 
7 81 CAGAATTGGT TAGAGGAACA TATTAATAAA 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT 
901 GGTTACTCTC GTGATACTGT CATTAAAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA 

12 01 CACATTCCTA AAGGGACAAG ATTATTCGCA 
1261 CTCTGGTCTG ATCCTGATAC TTTCGATCCA 

13 21 CGTGGTCAGT ACTATAAGTA TATCCCGTTT 
13 81 ACTTATGCAT TGCAAGTGGA ACACTTAACA 
1441 AGAACTCCAA ATGACGAGCC CTTGGATATG 
1501 GTAAATCCTG TGGAACTGAT AATAGCGCCT 
1561 AGATCATCTT GCT 



CTAGTAACCT TCACATTTCT CTTCTTCTTC 
AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
AATGACGACG GCGACGACCG TCCATTAGCT 
GGCCCCGTTT TCACTTTTCG GCTAGGCCTT 
GCTGTAAAAG ACTGTTTCTC TACAAATGAC 
TACGGCGATT ACCTTGGCTA CAATAATGCC 
TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
TTCAAACACG TGAGATTTGC AAGAATTGAA 
GATGGAAATT CGAGTACGAT AAATTTAACT 
GTCGTGAAGA TGATCGCTGG AAAAAATTAT 
AGATTTAAGA AAGCGTTTGA GGATTTTATG 
GCATTTCCAA TTCCATTATT TAAATGGGTG 
AGGACTTTTA AAGATATAGA TTCTGTTTTT 
AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
TCAAAAATGA GTAATGAATA TCTTGGTGAA 
ACGGTGTTTA GTTTGGTCTT GGATGCAGCA 
ATGGCATTAT TGATAAACAA TCAAAAGGCC 
AAAGTTGGTA AGGACAGATG GGTAGAAGAG 
GCTATTGTTA AAGAAGTGTT ACGATTATAT 
AATGTAGAAG ATTGTGTTGT TAGTGGATAT 
AACGTCATGA AACTGCAACG TGATCCTAAA 
GAGAGATTCA TTGCTACTGA TATTGACTTT 
GGTTCTGGAA GACGATCTTG TCCAGGGATG 
ATGGCACATT TGATCCAAGG TTTCAATTAC 
AAGGAAGGTG CAGGCATAAC TATACGTAAG 
CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 



SBQ ID 369 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS 
61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK 
181 DWLEELNFGL WKMIAGKNY ESGKGDEQVE 
241 DFQGHVKAMK RTFKDIDSVF QNWLEEHINK 
301 GYSRDTVIKA TVFSLVLDAA DTV.ZUJHINWG 
361 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE 
421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP 



KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
FKHVRFARIQ ASIKNLYTRI DGNSSTINLT 
RFKKAFEDFM ILSMEFVLWD AFPIPLFKWV 
REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
MALLINNQKA LTKAQEEIDT KVGKDRWVEE 
NVEDCWSGY HIPKGTRLFA NVMKLQRDPK 
GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
RLAPELY 
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FIG. 160 

NAME D425-AH11 

ORGANISM NICOTIANA TABACUM 
SBQ ID 370 

1 ATGCTTTCTC CCATAGAAGC CATTGTAGGA 
61 CTATGGACAA AAAAATCTCA AAAACCTTCA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC 
181 CAAAAACTCG GAGACTTAGC TGACAAATAC 
241 CCCCTTGTCT TAGTTGTAAG CAGTTACGAA 
201 GCCATTTTTT CCAATCGTCC AGCTTTTCTT 
3 61 ATGCTATTTT TGGCC AATTA CGGACCTTAC 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA 
481 GCGAGCATTA AGAATTTATA TACTCGAATT 
541 GATTGGTTAG AAGAATTGAA TTTTGGTCTG 
601 GAATCCGGTA AAGGAGATGA ACAAGTGGAG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT 
7 21 GATTTTCAAG GGCATGTTAA GGCTATGAAA 
7 81 CAGAATTGGT TAGAGGAACA TATTAATAAA 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT 
901 GGTTACTCTC GTGATACTGT CATTAAAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA 
1201 CACATTCCTA AAGGGACAAG ATTATTCGCA 
1261 CTCTGGTCTG ATCCTGATAC TTTCGATCCA 
13 21 CGTGGTCAGT ACTATAAGTA TATCCCGTTT 
13 81 ACTTATGCAT TGCAAGTGGA ACACTTAACA 
1441 AGAACTCCAA ATGACGAGCC CTTGGATATG 
1501 GTAAATCCTG TGGAACTGAT AATAGCGCCT 
1561 AGATCATCTT GCT 



CTAGTAACCT TCACATTTCT CTTCTTCTTC 
AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
AATGACGACG GCGACGACCG TCCATTAGCT 
GGCCCCGTTT TCACTTTTCG GCTAGGCCTT 
GCTGTAAAAG ACTGTTTCTC TACAGATGAC 
TACGGCGATT ACCTTGGCTA CAATAATGCC 
TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
TTCAAACACG TGAGATTTGC AAGAATTGAA 
GATGGAAATT CGAGTACGAT AAATTTAACT 
ATCGTGAAGA TGATCGCTGG AAAAAATTAT 
AGATTTAAGA AAGCGTTTAA GGATTTTATG 
GCATTTCCAA TTCCATTATT TAAATGGGTG 
AGGACTTTTA AAGATATAGA TTCTGTTTTT 
AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
TCAAAAATGA GTAATGAATA TCTTGGTGAA 
ACGGTGTTTG GTTTGGTCTT GGATGCAGCA 
ATGGCATTAT TGATAAACAA TCAAAAGGCC 
AAAGTTGGTA AGGACAGATG GGTAGAAGAG 
GCTATTGTTA AAGAAGTGTT ACGATTATAT 
AATGTAGAAG ATTGTGTTGT TAGTGGATAT 
AACGTCATGA AACTGCAACG TGATCCTAAA 
GAGAGATTCA TTGCTACTGA TATTGACTTT 
GGTTCTGGAA GACGATCTTG TCCAGGGATG 
ATAGCACATT TGATCCAAGG TTTCAATTAC 
AAGGAAGGTG CAGGCATAAC TATACGTAAG 
CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 



SBQ ID 371 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS 
61 QKLGDLADKY GPVFTFRLGL PLVLWSSYE 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK 
181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE 
241 DFQGHVKAMK RTFKDIDSVF QNWLEEHINK 
3 01 GYSRDTVIKA TVFGLVLDAA DTVALHINWG 
; 361 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE 
421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF 
481 RTPNDEPLDM KEGAGITIRK VNPVELIIAP 



KPLPPKIPGG WPVIGHLFHF NBDGDDRPLA 

AVKDCFSTDD AIFSNRPAFL YGDYLGYNNA 

FKHVRFARIQ ASIKNLYTRI DGNSSTINLT 

RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 

REKMEVNAEG NEQDFIDWL SKMSNEYLGE 

MALLINNQKA LTKAQEEIDT KVGKDRWVEE 

NVEDCWSGY HIPKGTRLFA NVMKLQRDPK 

GSGRRSCPGM TYALQVEHLT lAHLIQGFNY 
RLAPELY 
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FIG. 161 

NAME D427-AA5 
ORGANISM NICOTIANA TABACUM 
SSQ ID 372 

1 ATGCTTTCTC CCATAGAAGC CATTGTAGGA CTAGTAACCT TCACATTTCT CTTCTTCTTC 
61 CTATGGACAA AAAAATCTCA AAAACCTTCA AAACCCTTAC CACCGAAAAT CCCCGGAGGA 
121 TGGCCGGTAA TCGGCCATCT TTTCCACTTC AATGACGACG GCGACGACCG TCCATTAGCT 
181 CGAAAACTCG GAGACTTAGC TGACAAATAC GGCCCCGTTT TCACTTTTCG GCTAGGCCTT 
241 CCCCTTGTCT TAGTTGTAAG CAGTTACGAA GCTGTAAAAG ACTGTTTCTC TACAAATGAC 
301 GCCATTTTTT CCAATCGTCC AGCTTTTCTT TACGGCGATT ACCTTGGCTA CAATAATGCC 
3 61 ATGCTATTTT TGGCCAATTA CGGACCTTAC TGGCGAAAAA ATCGAAAATT AGTTATTCAG 
421 GAAGTTCTCT CCGCTAGTCG TCTCGAAAAA TTCAAACACG TGAGATTTGC AAGAATTCAA 
481 GCGAGCATTA AGAATTTATA TACTCGAATT GATGGAAATT CGAGTACGAT AAATTTAACT 
541 GATTGGTTAG AAGAATTGAA TTTTGGTCTG ATCGTGAAGA TGATCGCTGG AAAAAATTAT 
601 GAATCCGGTA AAGGAGATGA ACAAGTGGAG AGATTTAAGA AAGCGTTTAA GGATTTTATG 
661 ATTTTATCAA TGGAGTTTGT GTTATGGGAT GCATTTCCAA TTCCATTATT TAAATGGGTG 
721 GATTTTCAAG GGCATGTTAA GGCTATGAAA AGGACTTTTA AAGATATAGA TTCTGTTTTT 
781 CAGAATTGGT TAGAGGAACA TATTAATAAA AGAGAAAAAA TGGAGGTTAA TGCAGAAGGG 
841 AATGAACAAG ATTTCATTGA TGTGGTGCTT TCAAAAATGA GTAATGAATA TCTTGGTGAA 
901 GGTTACTCTC GTGATACTGT CATTAAAGCA ACGGTGTTTA GTTTGGTCTT GGATGCAGCA 
961 GACACAGTTG CTCTTCACAT AAATTGGGGA ATGGCATTAT TGATAAACAA TCAAAAGGCC 
1021 TTGACGAAAG CACAAGAAGA GATAGACACA AAAGTTGGTA AGGACAGATG GGTAGAAGAG 
1081 AGTGATATTA AGGATTTGGT ATACCTCCAA GCTATTGTTA AAGAAGTGTT ACGATTATAT 
1141 CCACCAGGAC CTTTGTTAGT ACCACACGAA AATGTAGAAG ATTGTGTTGT TAGTGGATAT 
1201 CACATTCCTA AAGGGACAAG ATTATTCGCA AACGTCATGA AACTGCAACG TGATCCTAAA 

12 61 CTCTGGTCTG ATCCTGATAC TTTCGATCCA GAGAGATTCA TTGCTACTGA TATTGACTTT 

13 21 CGTGGTCAGT ACTATAAGTA TATCCCGTTT GGTTCTGGAA GACGATCTTG TCCAGGGATG 
13 81 ACTTATGCAT TGCAAGTGGA ACACTTAACA ATGGCACATT TGATCCA.A.GG TTTCAATTAC 
1441 AGAACTCCAA ATGACGAGCC CTCGGATATG AAGGAAGGTG CAGGCATAAC TATACGTAAA 
1501 GTAAATCCTG TGGAACTGAT AATAGCGCCT CGCCTGGCAC CTGAGCTTTA TTAAAACCTA 
1561 AGATCATCTT GCTTG 



SEQ ID 373 

1 MLSPIEAIVG LVTFTFLFFF LWTKKSQKPS KPLPPKIPGG WPVIGHLFHF NDDGDDRPLA 
61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE AVKDCFSTND AIFSNRPAFL YGDYLGYNNA 
121 MLFLANYGPY WRKNRKLVIQ EVLSASRLEK FKHVRFARIQ ASIKNLYTRI DGNSSTINLT 
181 DWLEELNFGL IVKMIAGKNY ESGKGDEQVE RFKKAFKDFM ILSMEFVLWD AFPIPLFKWV 
241 DFQGHVKAMK RTFKDIDSVF QNWLEEHINK REKMEVNAEG NEQDFIDWL SKMSNEYLGE 
3 01 GYSRDTVIKA TVFSLVLD/VA DTVALHINWG MALLINNQKA LTKAQEEIDT KVGKDRWVEE 
3 61 SDIKDLVYLQ AIVKEVLRLY PPGPLLVPHE NVEDCWSGY HIPKGTRLFA NVMKLQRDPK 
421 LWSDPDTFDP ERFIATDIDF RGQYYKYIPF GSGRRSCPGM TYALQVEHLT MAHLIQGFNY 
481 RTPNDEPSDM KEGAGITIRK VNPVEIillAP RLAPELY 
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FIG. 163 



SBQ. ID. NO. 434 

D424-AA4 

1 GTTTTTCCCA TAGAAGCCTT TGTAGGACTA GTAACCTTCA CATTTCTCTC ATACTTCCTA 
61 TGGACAAAAA AATCTCAAAA ACTTCCAAAA CCCTTACCAC CGAAAATCCC CGGAGGATGG 
121 CCGGTAATCG GCCATCTTTT TCACTTCAAT AACGACGGCG ACGACCGTCC ATTAGCTCGA 
181 AAACTCGGAG ACTTAGCTGA TAAATACGGC CCCGTTTTCA CTTTTCGGCT AGGTCTTCCC 
241 CTTGTGCTAG TTGTAAGCAG TTACGAAGCT ATAAAAGATT GCTTCTCTAC AAATGACGCC 
301 ATTTTCTCCA ATCGTCCAGC TTTTCTTTAC GGCGAATACC TTGGCTACAA TAATACAATG 
3 61 CTTTTTCTAG CAAATTACGG ACCTTACTGG CGA 

ID* NO. 435 

AF5 

1 TAGTATACCT CCAAGCTATT GTTAAAAAGG TGTTACGATT ATATCCACCA GGACCTTTGT 
61 TAGTACCACA TGAAAATGTA AAGGATTGTG TTGTTAGTGG ATATCACATT CCTAAAGGGA 
121 CTAGATTATT CGCAAACGTC ATGAAACTGC AGCGCGATCC TAAATTCTTG TCAAATCCTG 
181 ATAAGTTCGA TCCAGAGAGA TTCATCGCTG GTGATATTGA CTTCCGTGGT CACCACTATG 

2 41 AGTTTATCCC ATTTGGTTCT GGAAGACGAT CTTGTCCGGG GATGACTTAT GCATCGCAAG 

3 01 TGGAACACCT AACAATGGCA CATTTAGTCC AGGGTTTCAA TTACAAAACT CCAAATGACG 
3 61 AGGCCTTGGA TATGAAGGAA GGTGCAGGCA TAACAATACG TAAGGTAAAT CCAGTGGAAT 
421 TGATAGTAAC GCCTCGCTTG GCACCTGAGC TTTACTAAAC CTAAGATCTT TCATCTTGG 

SBQ. ID. NO. 436 

1 VYLQAIVKKV LRLYPPGPLL VPHENVKDCV VSGYHIPKGT RIiFANVMKLQ RDPKFLSNPD 
61 KFDPERFIAG DIDFRGHHYE FIPFGSGRRS CPGMTYASQV EHLTMAHLVQ GFNYKTPNDE 
121 ALDMKEGAGI TIRKVNPVEL IVTPRLAPEL Y 

SBQ. ID. NO. 437 

D425-AA1 1 

1 GGATAGATGG GTAGAAGAGA GTGATATTAA GGATTTGGTG TACCTCCAAG CTATTGTCAA 
61 AGAAGTGTTA CGATTGTATC CACCAGGACC TTTGTTAGTA CCACATGAAA ATGTGGAGGA 
121 TTGTGTTGTT AGTGGATATC ACATTCCTAA AGGGACTAGA CTATTTGCGA ATGTCATGAA 
181 ACTGTAACGC GATCCTAAAC TCTGGCCAAA TCCTGATAAT TTCGATCCAG AGAGATTCAT 

2 41 CGCTGCAGAT ATTGACTTCC GTGGTCAGCA CTATGAGTAT ATCCCGTTTG GTTCTGGAAG 
301 ACGATCTTGT CCGGGGATGA CTTATGCATT GCAAGCGGAA CATCTAACAA TGGCACATTT 

3 61 GATCCAAGGT TTCAATTACA GGACTCCAAC TAACGAGCCC TTGGATATGA AGGAAGGTGC 
421 AGGCATAACT ATACGTAAGG TAAATCCTGT GGAAGTGCTA ATTAAGCCTC GCCTGGCACC 
481 CGAGCTTTAT TAAAACCTAA GATCATCTTG CT 

SBQ. ID. NO. 438 

1 DRWVEESDIK DLVYLQAIVK EVLRLYPPGP LLVPHENVED CWSGYHIPK GTRLFANVMK 
61 L.RDPKLWPN PDNFDPERFI AADIDFRGQH YEYIPFGSGR RSCPGMTYAL QAEHLTMAHL 
121 IQGFNYRTPT NEPLDMKEGA GITIRKVNPV EVLIKPRLAP ELY 

SBQ. ID. NO. 439 

D425-AF11 

1 AGGATAGATG GGTAGAAGAG AGTGATATTA AGGATTTGGT GTACCTCCAA GCTATTGTCA 

61 AAGAAGTGTT ACGATTGTAT CCACCAGGAC CTTTGTTAGT ACCACATGAA AATGTAGAGG 

121 ATTGTGTTGT TAGTGGATAT CACATTCCTA AAGGGACTAG ACTATTTGCG AATGTCATGA 

181 AACTGCAACG CGATCCTAAA CTCTGGCCAA ATCCTGATAA TTTCGATCCA GAGAGATTCG 

2 41 TCGCTGCAGA TATTGACTTC CGTGGTCAGC ACTATGAGTA TATCCCGTTT GGTTCTGGAA 

3 01 GACGATCTTG TCCGGGGGTG ACTTATGCAT TGCAAGTGGA ACATCACATT TGATCCAAGG 
3 61 TTTCAATTAC AGGACTCCAA CTAACGAGCC CTTGGATATG AAGGAAGGTG CACGCATAAC 
421 TATACGTAAG GTAAATCCTG TGGAAGTGCT AATTAAGCCT AGCCTGGCAC CTGAGCTTTA 



SBQ. 

D424- 
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481 TTAAAACCTA AGATCATCTT GCT 
SEQ* ID. NO. 440 

1 VYPVWFWKTI LSGGDLCIAS GTSHLIQGFN YRTPTNEPLD MKEGARITIR KVNPVEVLIK 
61 PSLAPELY 

SEQ. XD. NO. 441 

D425-AH10 

1 CTGAAATAGA GGGAGTATAA TATTCATTTC AAGAGATCAC TATAAAAAGG AAGTTCGTGA 
61 TAGTTTGATT CTCAAGTTCT TATCTAAAAA TCCATAATGG TTTTTCCCAT AGAAGCCATT 
121 GTAGGACTAG TAACCTTCAC ATTTCTCTTA TACTTCCTAT GGACAAAAAA ATCTCAAAAA 
181 CCTCCAAAAC CCTTACCACC GAAAATCCCC GGAGGATGGC CGGTAATCGG CCATCTTTTC 
241 TACTTCGATA ACGAAGGCGA CGACCGTCCA TTAGCTCGGA GACTTAGCTG ATAAATACGG 
3 01 CCCCGTTTTC ACTTTTCGGC TAGGTCTTCC CCTTGTGCTA GTTGTAAGCA GTTATGAAGC 
361 TATAAAAGAT TGCTTCTCTA CAAATGACGC CATTTTCTCC AATCGTCCAG 

SEQ. XD. NO. 442 

1 MVFPIEAIVG LVTFTFLLYP LWTKKSQKPP KPLPPKIPGG WPVIGHLFYF DNEGDDRPLA 

SEQ. XD. NO. 443 

D426-AA3 

1 GGACCTTTGT TAGTACCACA TGAAAATGTA AAGGATTGTG TTGTTAGTGG ATATCACATT 

61 CCTAAAGGGA CTAGATTATT CGCAAACGTC ATGAAACTGC AGCGCGATCC TAAATTCTTG 

121 TCAAATCCTG ATAAGTTCGA TCCAGAGAGA TTCATCGCTG GTGATATTGA CTTCCGTGGT 

181 CACCACTATG AGTTTATCCC ATTTGGTCCT GGAAGACGAT CTTGTCCGGG GATGACTTAT 

241 GCATTGCAAG TGGAACACCT AACAATGGCA CATTTAATCC AGGGTTTCAA TTACAAAACT 

3 01 CCAAATGACG AGCCCTTGGA TATGAAGGAA GGTGCAGGCA TAACTATACG TAAGGTAAAT 

3 61 CCTGTGGAAC TGATAATAGC GCCTCGCCTG GCACCTGAGC TTTATTAAAA CCTAAGATCA 

421 T 

SEQ. XD. NO. 444 

1 GPLLVPHENV KDCWSGYHI PKGTRLFANV MKLQRDPKFL SNPDKFDPER FIAGDIDFRG 
61 HHYEFIPFGP GRRSCPGMTY ALQVEHLTMA HLIQGFNYKT PNDEPLDMKE GAGITIRKVN 
121 PVELIIAPRL APELY 



SEQ. XD. NO. 445 

D426-AG1 

1 ATGACATTAT TGATAAACAA TCAAAATGCC TTGATGAAAG CACAAGAAGA GATAGACACA 
61 AAAGTTGGTA AGGATAGATG GGTAGAAGAG AGTGATATTA AGGATTTAGT ATACCTCCAA 
121 GCTATTGTTA AAAAGGTGTT ACGATTATAT CCACCAGGAC CTTTGTTAGT ACCACATGAA 
181 AATGTAAAGG ATTGTGTTGT TAGTGGATAT CACATTCCTA AAGGGACTAG ATTATTCGCA 
241 AACGTCATGA AACTGCAGCG CGATCCTAAA CTCTTGTCAA ATCCTGATAA GTTCGATCCA 
3 01 GAGAGATTCA TCGCTGGTGA TATTGACTTC CGTGGTCACC ACTATGAGTT TATCCCATTT 
3 61 GGTTCTGGAA GACGATCTTG TCCGGGGATG ACTTATGCAT TGCAAGTGGA ACACCTAACA 
421 ATGGCACATT TAATCCAGGG TTTCAATTAC AAGACTCCAA ATGGCGAGGC CTTGGATATG 
481 AAGGAAGGTG CAGGCATAAC AATACGTAAG GTAAATCCAG TGGAATTGAT AATAGCGCCT 
541 CGCCTGGCAC CTGAGCTTTA TTAATACCTA AGATCAT 

SEQ. ID. NO. 446 

1 MTLLINNQNA LMKAQEEIDT KVGKDRWVEE SDIKDLVYLQ AIVKKVLRLY PPGPLLVPHE 
61 NVKDCWSGY HIPKGTRLFA NVMKLQRDPK LLSNPDKFDP ERFIAGDIDF RGHHYEFIPF 
121 GSGRRSCPGM TYALQVEHLT MAHLIQGFNY KTPNGEALDM KEGAGITIRK VNPVELIIAP 
181 RLAPELY 
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SEQ* XD. NO. 447 

D427-AA6 

1 GGATATCACA TTCCTAAAGG GACTAGATTA TTCGCAAACG TCATGAAACT GCAGCGCGAT 
61 CCTAAACTCT TGTCAAATCC TGATAAGTTC GATCCAGAGA GATTCATCGC TGGTGATATT 
121 GACTTCCGTG GTCACCACTA TGAGTTTATC CCATTTGGTT CTGGAAGACG ATCTTGTCCG 
181 GGGATGACTT ATGCATTGCA AGTGGAACAC CTAACAATGG CACATTTAAT CCAGGGTTTC 
241 AATTACAAAA CTCCAAATGA CGAGGCCTTG GATATGAAGG AAGGTGCAGG CATAACAATA 
3 01 CGTAAGGTAA ATCCAGTGGA ATTGATAATA ACGCCTCGCT TGGCACCTGA GCTTTATTAA 
3 61 AACCTAAGAT CATCTTGCTT G 

SBQ. XD. NO. 448 

1 GYHIPKGTRL FANVMKLQRD PKLLSNPDKF DPERFIAGDI DFRGHHYEFI PFGSGRRSCP 
61 GMTYALQVEH LTMAHLIQGF NYKTPNDEAL DMKEGAGITI RKVNPVELII TPRLAPELY 

SEQ. XD. NO. 449 

D427-AB6 

1 CGATTATATC CACCGGGACC TTTATTAGTA CCCCATGAAA ATGTAGAGGA TTGTGTTGTT 
61 AGTGGATATC ACATTCCTAA AGGGACTAGA CTATTCGCGA ACGTTATGAA ATTACAGCGC 
121 GATCCTAAAC TCTGGTCAAA TCCTGATAAG TTCGATCCAG AGAGATTTTT CGCTGCTGAT 
181 ATTGACTTTC GTGGTCAACA CTATGAGTTT ATCCCATTTG GTTCTGGAAG ACGATCTTGT 
241 CCGGGGATGA CTTATGCTAT GCAAGTGGAA CACCTAACAA TCGCACACTT GATCCAGGGT 
301 TTCAATTACA AAACTCCAAA TGACGAGCCC TTGGATATGA AGGAAGGTGC AGGATTAACT 
3 61 ATACGTAAGG TAAATCCTAT AGAAGTGGTA ATTACGCCTC GCCTGACACC TGAGCTTTAT 
421 TAAAACCTAA GATCATCTTG CTTG 

SEQ. XD. NO. 450 

1 RliYPPGPLLV PHENVEDCW SGYHIPKGTR LFANVMKLQR DPKLWSNPDK FDPERFFAAD 
61 IDFRGQHYEF IPFGSGRRSC PGMTYAMQVE HLTIAHLIQG FNYKTPNDEP LDMKEGAGLT 
121 IRKVNPIEW ITPRLTPELY 

SEQ. XD. NO. 451 

D428-AC9 

1 ATATCACATT CCTAAAGGGA CTAGATTATT CGCAAACGTC ATGAAACTGC AGCGCGATCC 
61 TAAACTCTTG TCAAATCCTG ATAAGTTCGA TCCAGAGAGA TTCATCGCTG GTGATATTGA 
121 CTTCCGTGGT CACCACTATG AGTTTATCCC ATTTGGTTCT GGAAGACGAT CTTGTCCGGG 
181 GATGACTTAT GCATTGCAAG TGGAACACCT AACAATGGCA CATTTAATCC AGGGTTTCAA 
241 TTACAAAACT CCAAATGACG AGGCCTTGGA TATGAAGGAA GGTGCAGGCA TAACAATACG 
301 TAAGGTAAAT CCAGTGGAAT TGATAATAAC GCCTCGCTTG GCACCTGAGC TTTATTAAAA 
3 61 CCTAAGATCA TCTTGCTTG 

SEQ. XD. NO. 452 

1 YHIPKGTRLF ANVMKLQRDP KLLSNPDKFD PERFIAGDID FRGHHYEFIP FGSGRRSCPG 
61 MTYALQVEHL TMAHLIQGFN YKTPNDEALD MKEGAGITIR KVNPVELIIT PRLAPELY 

SEQ. XD. NO. 453 

D428-AH10 

1 GTGATACTGT CATAAAAGCA ACAGTGTTTA GTTTAGTCTT GGATGCTGCG GACACAGTTG 

61 CTCTTCACAT GAATTGGGGA ATGGCATTAT TGATAAACAA TCAACATGCC TTGAAGAAAG 

121 CGCAAGAAGA GATAGATAAA AAAGTTGGTA AGGATAGATG GGTAGAAGAG AGTGATATTA 

181 AGGATTTGGT ATACCTCCAA ACTATTGTTA AAGAAGTGTT ACGATTATAT CCACCGGGAC 

241 CTTTATTAGT ACCCCATGAA AATGTAGAGG ATTGTGTTGT TAGTGGATAT CACATTCCTA 

301 AAGGGACTAG ACTATTCGCG AACGTTATGA AATTACAGCG CGATCCTAAA CTCTGGTCAA 

361 ATCCTGATAA GTTCGATCCA GAGAGATTTT TCGCTGCTGA TATTGACTTT CGTGGTCAAC 

421 ACTATGAGTT TATCCCATTT 
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SBQ. ID. NO. 454 

1 DTVIKATVFS LVLDAADTVA LHMNWGMALL INNQHALKKA QEEIDKKVGK DRWVEESDIK 
61 DLVYLQTIVK EVLRLYPPGP LLVPHENVED CWSGYHIPK GTRLFANVMK LQRDPKLWSN 
121 PDKFDPERFF AADIDFRGQH YEFIPF 

SBQ. ID. NO. 455 

D429-AA1 

1 ATAATGCTTT CTCCCATAGA AGCCATTGTA GGAGCAGTAA CCCTAATTAC ATTTCTCTTA 
61 TACTTCCTAT GTACAAAAGA ATCTCAAAAA CATTCAAAGC CCTTACCAAC GAAAATCCCC 
121 GGAGGATGGC CGGTAATCGG CCATCTTTTC CACTTCAATA ACGACGGCGA CGACCGTCCA 
181 TTAGCTCGAA AACTCGGAGA CTTAGCTGAT AAATACGGCC CCGTTTTCAC TTTTCGGCTA 
241 GGTCTTCCCC TTGTGCTAGT TGTAAGCAGT TACGAAGCTT TAAAAGATTG CTTCTCTACA 
301 AATGACGCCA TTTTCTCCAA TCGTCCAGCT TTTCTTTACG GCGAATACCT TGGCTACAAT 
3 61 AATACAATGC TTTTTCTAGC AAATTA 

SBQ. ID. NO, 456 

1 MLSPIEAIVG AVTLITFLLY FLCTKESQKH SKPLPTKIPG GWPVIGHLFH FNNDGDDRPL 
61 ARKLGDLADK YGPVFTFRLG LPLVLWSSY EALKDCFSTN DAIFSNRPAF LYGEYLGYNN 
121 TMLFLAN 



SBQ. ID. NO. 457 

D430-AA3 

1 



AGAAAGCACA 
61 ATATTAAGGA 
121 CGGGACCTTT 
181 TTTCTAAAGG 
241 GGCCAAATCC 
3 01 GTCAGCATTA 
3 61 ATGCATTGCA 
421 CTCCAAATGA 
481 ATCCCGTGGA 
541 CTTTCATCTT 



AGAAGAGATA 
TTTGGTGTAC 
GTTAGTACCA 
GACTAGACTA 
TGATAATTTC 
TGAGTATATC 
AGTGGAACAC 
CGAGCCCTTG 
AGTGATAATT 
GG 



GACACAAAAG 
CTCCAAGCTA 
CATGAAAATA 
TTCGCAAATG 
GATCCAGAGA 
CCGTTTGGTT 
TTAACAATGG 
GATATGAAGG 
ATGCCTCGCC 



TTGGCAAGGA 
TTGTTAAAGA 
TAGAGGATTG 
TTATGAAACT 
GATTTGTCGC 
CTGGAAGACG 
CACATTTGAT 
AAGGTGCAGG 
TGGCACCTGA 



TAGATGGGTA 
AGTGTTACGA 
TGTTGTTAGT 
GCAGCGCGAT 
TGCAGGTATT 
ATCTTGTCCG 
CCAGGGTTTC 
TATAACTATA 
GCTTTATTAA 



GAAGAGAGTG 
TTATATCCAC 
GGATATTACA 
CCTAAACTCT 
GACTTTCGTG 
GGGATGACTT 
AATTACAGCA 
CGTAAGGTAA 
AACCTAAGAT 



SBQ. ID. NO. 458 

1 KAQEEIDTKV GKDRWVEESD IKDLVYLQAI VKEVLRLYPP GPLLVPHENI EDCWSGYYI 
61 SKGTRLFANV MKLQRDPKLW PNPDNFDPER FVAAGIDFRG QHYEYIPFGS GRRSCPGMTY 
121 ALQVEHLTMA HLIQGFNYST PNDEPLDMKE GAGITIRKVN PVEVIIMPRL APELY 



SBQ. ID. NO 

D431-AE6 

1 



459 



ATAATGCTTT 
61 TTCCTATGGA 
121 GGATGGCCGG 
181 GCTCGAAAAC 

2 41 CTTCCCCTTG 

3 01 GATGCCATTT 
3 61 GCCATGCTAT 
421 CAGGAAGTTC 
481 CAAACGAGCA 
541 ACTGATTGGT 
601 TATGATTCC 



CTCCCATAGA 
CAAAAAAATC 
TAATTGGCCA 
TCGGAGACTT 
TGTTAGTTGT 
TCTCTAATCG 
TTTTGGCAAA 
TCTCAGCTAG 
TTAAGAATTT 
TAGAAGAATT 



AGCCATTGTA 
TCAAAAACCT 
TCTTTTCCAC 
AGCTGACAAA 
AAGCAGTTAC 
TCCAGCTTTT 
TTACGGACCT 
TCGTCTCAAA 
ATACACTCGA 
GAATTTTGGT 



GGACTAGTAA 
TCAAAACCCT 
TTCAATGACG 
TACGGCCCCG 
GAAGCTATAA 
CTTTACGGCG 
TACTGGCGAA 
AAATTCAAAC 
ATTGATAGAA 
CTCATCGTGA 



CCTTCACATT 
TACCACCGAA 
ACGGCAACGA 
TTTTCACTTT 
AAGACTGTTT 
AATACCTTGG 
AAAATCGTAA 
ACGTGAGATT 
ATTCGAGTAC 
AGATGATAGC 



TCTCTTCTAC 
AATCCCCGGA 
CCGTCCATTA 
TCGGCTAGGC 
CTCTACAAAT 
CTACAATAAT 
ATTAGTTATT 
CGCCAGAATT 
GATAAATTTA 
TGGGAAAAAT 



SBQ. ID. NO. 460 

1 MLSPIEAIVG LVTFTFLFYF LWTKKSQKPS 

61 RKLGDLADKY GPVFTFRLGL PLVLWSSYE 

121 MLFLANYGPY WRKNRKLVIQ EVLSASRLKK 

181 DWLEEIiNFGL IVKMIAGKNY DS 



KPLPPKIPGG WPVIGHLFHF NDDGNDRPLA 
AIKDCFSTND AIFSNRPAFL YGEYLGYNNA 
FKHVRFARIQ TSIKNLYTRI DRNSSTINLT 
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SBQ. 10. NO. 461 

D113-AE9 

1 TTGATTTACA TGTTTGGTCA AGAACATAAA GACCATCGCC GTCGAATGGC CCCTAATTTT 

61 ACCCCTAAAG CTCTTGCTAC TTACACTGTT ATCCAACAAA AGATTATTAT CAAACACTTT 

121 CAGTCCTGGT TGGACGAAGC ATCCCAATCC CCTAACAAAC CAATCACACT TCGCCTCCTT 

181 TGCCGCGATA TGAACTTGGA TACTTCCCAG ACTGTCTTCG TTGGTCCATA CTTAAACGAA 

241 GATTCCAGAA AGCAGTTCAA TGTTGACTAC AATTACTTCA ATGTTGGGTT AATGAAACTC 

3 01 CCTGTTGATT TACCTGGTTT CGCCTTCAGA GATGCTAGGT TAGCTGTTGG GAGACTAGTC 

3 61 GACACGCTTT CCGTTTGTGC AGCACAGAGC CAAATTAAGA TGCGAGGTGA CGAAGAACCC 

421 ACTTGCTTGA TTGATTTTTG GATGCAGGAG AATTTCAGAG AGATTCAGGA AGCTAAGATT 

481 AATGGTTCAC AAAAGCCGTT TGAGTATACC AGTAAGCAAC TTGGTGGCTT CTTATTTGAC 

541 TTCCTCTTTG CGGCTCAAGA TGCTTCTACT TCATCTCTGT TATGGGCAAT GGTGCTTTTG 

6 01 GAATCTCACC CGCAAGTTCT GGAGAGAGTC CGAGCCGAAG TGGCGAAATT CTGGTCGCCA 
661 GAATCTGAGG AGCAGCCGTT GACGGCGGAG ATGCTTAGGG AAATGAAGTA CCTGGAGGCG 
721 GTGGCGCGTG AGGTTGTTAG GATCAGAACT CCGGCGGCTT TGGTGCCGCA CATTGCCGGC 

7 81 GAAGAATTCC GGTTAACTGA TGATTATGTT ATTCCAAAGG GGACTATTGT TTTCCCTTCG 
841 GTTTTTGACT CGTCTTTCCA GGGGTTTCCT GAACCGGAGA AGTTTGACCC GGACCGGTTC 
901 ACAGAGGAGA GGCAAGAGGA ACGGGTTTAC AAGAAGAATT ATCTAGCATT TGGAGCTGGG 
961 CCCCATGGAT GTGTGGGACA GAGGTATGCT ATAAACCATT TGATGCTCTT TATTGCGTTG 

1021 TTCACGGCTC TGATTGATTT CAAGAGGCAC AAAACGGACG GCTGTGATGA TATCGCGTAT 

1081 ATTCCAACCA TTGCTCCAAA GGATGATTGT AAAGTGTTCC TTTCACAGAG GTGCACTCGA 

1141 TTCCCATCTT TTTCATGAAC TAATTGCACC TTTTATTTAA TTCTGATCCT CAAATTGGTC 

1201 CCATTGGACC ATGGATGTAA TAGGACCAAT TGCAAGAATG GGGTCCAATG TATTTGTTCT 

12 61 TTCCATTTAT TTTTTTT 

SEQ. ID* NO. 462 

1 LIYMFGQEHK DHRRRMAPNF TPKALATYTV IQQKIIIKHF QSWLDEASQS PNKPITLRLL 

61 CRDMNLDTSQ TVFVGPYLNE DSRKQFNVDY NYFNVGLMKL PVDLPGFAFR DARLAVGRLV 

121 DTLSVCAAQS QIKMRGDEEP TCLIDFWMQE NFREIQEAKI NGSQKPFEYT SKQLGGFLFD 

181 FLFAAQDAST SSLLWAMVLL ESHPQVLERV RAEVAKFWSP ESEEQPLTAE MLREMKYLEA 

241 VAREWRIRT PAALVPHIAG EEFRLTDDYV IPKGTIVFPS VFDSSFQGFP EPEKFDPDRF 

3 01 TEERQEERVY KKNYLAFGAG PHGCVGQRYA INHLMLFIAL FTALIDFKRH KTDGCDDIAY 

3 61 IPTIAPKDDC KVFLSQRCTR FPSFS 

SEQ. ID. NO. 463 

D114-AE12 

1 TCATTTTCAG TATGCTATGT TTTGTCTACT TGTGTTGATG TGTTTCGGGG ATAAGCTCGA 
61 CGAGGCTCAA ATCAAACAAA TTGAAGGTGT TCAGCGCAGG TTGCTCCTGG GTTTCCGACG 
121 ATTCAATATA CTCAATTTCT TACCCAGAGT TGGGAAAATA ATCTTTAGGA ATCGCTGGAA 
181 GGAACTAATT GAACTACGTC AAGAGCAAGA GAAGATCTTC ATTCCTTTGA TTGAAGCTCG 
241 AAGTATTAGG GCCAAAGAAC AAAAGCCTGA GGAAGAAGTG GTGGCTTATG TGGATACGCT 
3 01 GTTGAATTTG GAATTGCCAG GGGAAAACAG GAGCCTCAAT TATGGGGAAA TGGTTAGCCT 
3 61 CTGCAGTGAA TTCCTAAACG CCGGAACTGA TACGACGTCC ACCGCCTTAC AGTGGGTTAT 
421 GGCCAACTTG GTCAAAAACC CTTCCATTCA GGGAAAACTA TATCAAGAAA TTGCTAGTGT 
481 AGTGGGAGAG AAACAGAGCA AGTTGACAGA AGAGGTGGTA AAGGAGGACG ATCTGCATAA 
541 AATGCCATAC TTGAGAGCAG TGATCTTTCT TAGGCGACAC CCGCCTGGTC ACTTTGTGCT 
6 01 GCCACATACG GTGACAGAGG AAGTAGAACT GAATGGCTAT GTTGTCCCGA AGAATGCCAC 
661 CATCAATTTC ATGGTTGCAG ACATGGGTTT GGACCCAAAG GTGTGGGAGG ATCCCTTGGA 
721 ATTCAAGCCA GAGAGGTTCT TAATGGAGGG ATCAGATAAG GAAGGGTTCG ATATAACAGG 
781 AAGTAGAGAG ATCAAGATGA TGCCATTTGG CGCTGGTAGG AGAATTTGCC CAGGCTATGC 
841 TTTGGCTATG CTTCATCTAG AGTACTTTGT GGCTAATTTG GTTTGGCATT TTCGATGGGA 
901 GGCTGTGGAG GGAGATGATG TTGATCTTTC AGAAAAGCTA GAATTCACCG TTGTGATGAA 
961 GAATCCACTT CGAGCTCGTA TCTGCCTCAG AGTTAACTCT ATTTGAATTT GGTAATTACT 
1021 AGTTCTTTCT ATTTGCATTG TTCCCTGTTG ATGGACTTCC CCCATATAGT ACTGGAAGTT 
1081 AGAGGGAGAA TGATTATTAA TGCCTTGCTG CAATATTAGC TTAGTAGTTA GTAGTGAATA 
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1141 TAATTGAAAC TGGATATTTC TATCTTATGT GTTGTACATT TGGTTCATTG CAAAGGGCGA 
1201 ATTC 



SEQ. ID. NO. 464 

1 HFQYAMFCLL VLMCFGDKLD EAQIKQIEGV QRRLLLGFRR FNILNFLPRV GKIIFRNRWK 
61 ELIELRQEQE KIFIPLIEAR SIRAKEQKPE EEWAYVDTL LNLELPGENR SLNYGEMVSL 
121 CSEFLNAGTD TTSTALQWVM ANLVKNPSIQ GKLYQEIASV VGEKQSKLTE EWKEDDLHK 
181 MPYLRAVIFL RRHPPGHFVL PHTVTEEVEL NGYWPKNAT INFMVADMGL DPKVWEDPLE 
241 FKPERFLMEG SDKEGFDITG SREIKMMPFG AGRRICPGYA LAMLHLEYFV ANLVWHFRWE 
3 01 AVEGDDVDLS EKLEFTWMK NPLRARICCR VNSI 



SEQ. ID. NO. 465 

D119-AC3 

1 ATAAATCTAA CTGATTGGTT AGAAGAATTG AATTTTGGTC TGATCGTGAA AATGATCGCT 
61 GGGAAAAATT ATGAATCCGG TAAAGGAGAT GAACAAGTGG AAAGATTTAA GAATGCGTTT 
121 AAGGATTTCA TGGTTTTATC AATGGAATTT GTATTATGGG ATGCATTTCC AATTCCATTA 
181 TTTAAATGGG TGGATTTTCA AGGTCATATT AAGGCAATGA AAAGGACATT TAAGGATATA 
241 GATTCTGTTT TTCAGAACTG GTTAGAGGAA CATATTAATA AAAGAGAAAA AATGGAGGTT 
3 01 AATGCAGAAG GGAATGAACA AGATTTCATT GATGTGGTGC TTTCAAAAAT GAGTAATGAA 
3 61 TATCTTGGTG AAGGTTACTC TCGTGATACT GTCATTAAAG CAACGGTGTT TAGTTTGGTC 
421 TTGGATGCAG CAGACACAGT TGCTCTTCAC ATAAATTGGG GAATGGCATT ATTGATAAAC 
481 AATCAAAAGG CCTTGACGAA AGCACAAGAA GAGATAGACA CAAAAGTTGG TAAGGACAGA 
541 TGGGTAGAAG AGAGTGATAT TAAGGATTTG GTATACCTCC AAGCTATTGT TAAAGAAGTG 
601 TTACGATTAT ATCCACCAGG ACCTTTGTTA GTACCACACG AAAATGTAGA AGATTGTGTT 
661 GTTAGTGGAT ATCACATTCC TAAAGGGACA AGATTATTCG CAAACGTCAT GAAACTGCAA 
721 CGTGATCCTA AACTCTGGTC TGATCCTGAT ACTTTCGATC CAGAGAGATT CATTGCTACT 
7 81 GATATTGACT TTCGTGGTCA GTACTATAAG TATATCCCGT TTGGTTCTGG AAGACGATCT 
841 TGTCCAGGGA TGACTTATGC ATTGCAAGTG GAACACTTAA CAATGGCACA TTTGATCCAA 
901 GGTTTCAATT ACAGAACTCC AAATGACGAG CCCTTGGATA TGAAGGAAGG TGCAGGCATA 
961 ACTATACGTA AGGTAAATCC TGTGGAACTG ATAATAACGC CTCGCCTGGC ACCTGAGCTT 
1021 TATTAAAACC TAAGATGTTT CATCTTGGTT GATCATTGTT TAATAC 

SEQ. ID. NO. 456 

1 INLTDWLEEL NFGLIVKMIA GKNYESGKGD EQVERFKNAF KDFMVLSMEF VLWDAFPIPL 
61 FKWVDFQGHI KAMKRTFKDI DSVFQNWLEE HINKREKMEV NAEGNEQDFI DWLSKMSNE 
121 YLGEGYSRDT VIKATVFSLV LDAADTVALH INWGMALLIN NQKALTKAQE EIDTKVGKDR 
181 WVEESDIKDL VYLQAIVKEV LRLYPPGPLL VPHENVEDCV VSGYHIPKGT RLFANVMKLQ 
241 RDPKLWSDPD TFDPERFIAT DIDFRGQYYK YIPFGSGRRS CPGMTYALQV EHLTMAHLIQ 
301 GFNYRTPNDE PLDMKEGAGI TIRKVNPVEL IITPRLAPEL Y 



SEQ. ID. NO. 467 

D132-AA5 

1 ATGAGAATGG TAGCTGGAAA GAGATATTAT GGTGAAGAGG TAGATAACGA GGAGGCAAAC 
61 CATTTTCGGG AGCTTGTAGA AGAGGTTATT TCATATGGGG GTGCATCAAA TCCCACGGAT 
121 TTCATGCCTG CAATATTTCG TTGCTTTTTC AGGAGTATGG AGAAGAATTT GGCCAGGCTT 
181 GGTAGCAAAA TGGACGCGCT CTTGCAAGGC TTGATTGATG AACACCGTCG TGATAAAAGC 

2 41 AGAAATTCCA TGATTGATCA TTTGCTTTCT CTGCAAGAAT CAGAACCAGA ATATTACTCT 

3 01 GATCAAATCA TCAAAGGAAT AATATTGGTC ATGCTGAATG CGGGGACTGA AACATCATCT 
3 61 GTAACAATAG AATGGGCAAT GTCTCTTTTA CTCAATCATC CAGAGGTGTT GGAAAAGGCC 
421 AAAGCTGAAA TAGACGACCA TGTGGGTAAA GATCGTTTAG TGGATGAAGC AGATTTACCC 
481 AAGCTGAAAT ACCTTCAAAG TATTATTTCA GAGACACTTC GATTGTACCC TGCAGGACCA 
541 ATGCTAGTGC CTCATGGATC ATCTGATGAT TGCACTATCG CTGGGTTGCA CATTCCACGT 
601 GGCACGATGC TATTGGTGAA TGCTTGGGCC ATCCACAGGG ACCCATTACT TTGGGAGGAT 
661 CCAGAGAGCT TCAAGCCAGA AAGGTTTGAA GGTGTGCAAG TGGAATCATG GAAGCTATTG 
721 CCATTTGGAA TGGGAAGGAG AGCGTGTCCA GGTTCTGGAC TTGCTCAACG TGTGGTTGGT 
781 TTAGCTTTAG CATCTCTAGT GCAGTGTTTT GAGTGGAAAA GGGTAAGTGA AGAGGTGGTT 
841 GATTTGACTG AAGGAAAAGG TCTCACCATG CCAAAAGCTG AGCCACTCAT GGCTAGGTGC 
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901 GAAGCTCGTG ATATTCTTCA CAAAGTTGTT TCAGAAATAT CCTAACGTTT CAGAGTGTTT 
961 CTTGCATTTT TTTAGTGCTC CATACCTCTA GTTT 

SEQ. ID* NO. 468 

1 MRMVAGKRYY GEEVDNEEAN HFRELVEEVI SYGGASNPTD FMPAIFRCFF RSMEKNLARL 

61 GSKMDALLQG LIDEHRRDKS RNSMIDHLLS LQESEPEYYS DQIIKGIILV MLNAGTETSS 

121 VTIEWAMSLL LNHPEVLEKA KAEIDDHVGK DRLVDEADLP KLKYLQSIIS ETLRLYPAGP 

181 MLVPHGSSDD CTIAGLHIPR GTMLLVNAWA IHRDPLLWED PESFKPERFE GVQVESWKLL 
241 PFGMGRRACP GSGLAQRWG LALASLVQCF EWKRVSEEW DLTEGKGLTM PKAEPLMARC 
3 01 EARDILHKW SEIS 

SSQ. ID. NO. 469 

D223-BB10 

1 CTCCTTTGCC GCGATATGAA CTTGGATACT TCCCAGACTG TCTTCGTTGG TCCATACTTA 
61 AACGAAGATT CCAGAAAGCA GTTCAATGTT GACTACAATT ACTTCAATGT TGGGTTAATG 
121 AAACTCCCTG TTGATTTACC TGGTTTCGCC TTCAGAGATG CTAGGTTAGC TGTTGGGAGA 
181 CTAGTCGACA CGCTTTCCGT TTGTGCAGCA CAGAGCCAAA TTAAGATGCG AGGTGACGAA 
241 GAACCCACTT GCTTGATTGA TTTTTGGATG CAGGAGAATT TCAGAGAGAT TCAGGAAGCT 
3 01 AAGATTAATG GTTCACAAAA GCCGTTTGAG TATACCAGTA AGCAACTTGG TGGCTTCTTA 
3 61 TTTGACTTCC TCTTTGCGGC TCAAGATGCT TCTACTTCAT CTCTGTTATG GGCAATGGTG 
421 CTTTTGGAAT CTCACCCGCA AGTTCTGGAG AGAGTCCGAG CCGAAGTGGC GAAATTCTGG 
481 TCGCCAGAAT CTGAGCAGCC GTTGACGGCG GAGATGCTTA GGGAAATGAA GTACCTGGAG 
541 GCGGTGGCGC GTGAGGTTGT TAGGATCAGA ACTCCGGCGA CTTTGGTGCC GCACATTGCC 
601 GGCGAAGAAT TCCGGTTAAC TGATGATTAT GTTATTCCAA AGGGGACTAT TGTTTTCCCT 
661 TCGGTTTTTG ACTCGTCTTT CCAGGGGTTT CCTGAACCGG AGAAGTTTGA CCCGGACCGG 
721 TTCACAGAGG AGAGGCAAGA GGAACGGGTT TACAAGAAGA ATTATCTAGC ATTTGGAGCT 
781 GGGCCCCATG GATGTGTGGG ACAGAGGTAT GCTATAAACC ATTTGATGCT CTTTATTGCG 
841 TTGTTCACGG CTCTGATTGA TTTCAAGAGG CACAAAACGG ACGGCTGTGA TGATATCGCG 
901 TATATTCCAA CCATTGCTCC AAAGGATGAT TGTAAAGTGT TCCTTTCACA GAGGTGCACT 
961 CGATTCCCAT CTTTTTCATG AACTAATTGC GCCTTTTATT TAATTCTGAT CCTCAAATTG 
1021 GTCCCATTGG 

SBQ. ID. NO. 470 

1 LLCRDMNLDT SQTVFVGPYL NEDSRKQFNV DYNYFNVGLM KLPVDLPGFA FRDARLAVGR 
61 LVDTLSVCAA QSQIKMRGDE EPTCLIDFWM QENFREIQEA KINGSQKPFE YTSKQLGGFL 
121 FDFLFAAQDA STSSLLWAMV LLESHPQVLE RVRAEVAKFW SPESEQPLTA EMLREMKYTiE 
181 AVAREWRIR TPATLVPHIA GEEFRLTDDY VIPKGTIVFP SVFDSSFQGF PEPEKFDPDR 
241 FTEERQEERV YKKNYLAFGA GPHGCVGQRY AINHLMLFIA LFTALIDFKR HKTDGCDDIA 
3 01 YIPTIAPKDD CKVFLSQRCT RFPSFS 

SEQ. ID. NO. 471 

D245-AA8 

1 GATCGTTTAG TGGATGAAGC AGATTTACCC AAGCTGAAAT ACCTTCAAAG TATTATTTCA 
61 GAGACACTTC GATTGTACCN TGCAGCACCA ATGCTAGTGC CTCATGAATC ATCTGATGAT 
121 TGCACGGTCG CTGGCTTCCA CATTCCTCGT GGCACGATGC TATTGGTGAA TGCTTGGGCC 
181 ATCCACAGGG ACCCCTTACT TTGGGAGGAC CCAGAGAGCT TCAAGCCAAA AAGGTTTGAA 
241 GGTGTGCAAG CCGAATCATG GAAGCTATTG CCATTTGGAA TGGGAAGGAG AGCGTGCCCA 
3 01 GGTTCTGGAC TTGCTCAATG TGTGGTTGGT TTAGCTTTAG CAACTCTAGT GC AGTGTTTT 
361 GAGTGGAAAA GGGTAAGCGA AGAGGTGGTT GATTTGACGG AAGGAAAAGG TCTCACTATG 
421 CCAAAACCCG AGCCACTCAT GGCTAGGTGC GAAGCTCGTG ACATTTTTCA CAAAGTTGTT 
481 TCAGAAATAT CTTAATGTTT TGGGAGTCTG AATTAATAAT GTAAAATGTA TTTTCATTTT 
541 CTCATATAAT ATTGCACTAT CTACATTTCT GATATGTCAT TGAGATACTC CGG 

SEQ. ID. NO. 472 

1 DRLVDEADLP KLKYLQSIIS ETLRLYXAAP MLVPHESSDD CTVAGFHIPR GTMLLVNAWA 
61 IHRDPLLWED PESFKPKRFE GVQAESWKLL PFGMGRRACP GSGLAQCWG LALATLVQCF 
121 EWKRVSEEW DLTEGKGLTM PKPEPLMARC EARDIFHKVL SEIS 
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SBQ. ID. NO. 473 

D246-AE12 

1 GAGGACGATC TGCATAAAAT GCCATACTTG AGAGCAGTGA TCTTAGAGGG TCTTAGGCGA 
61 CACCCGCCTG GTCACTTTGT GCTGCCACAT ACGGTGACAG AGGAAGTAGA ACTGAATGGC 
121 TATGTTGTCC CGAAGAATGC CACCATCAAT TTCATGGTTG CAGACATGGG TTTGGACCCA 
181 AAGGTGTGGG AGGATCCCTT GGAATTCAAG CCAGAGAGGT TCTTAATGGA GGGATCAGAT 
241 AAGGAAGGGT TCGATATAAC AGGAAGTAGA GAGATCAAGA TGATGCCATT TGGCGCTGGT 
301 AGGAGAATTT GCCCAGGCTA TGCTTTGGCT ATGCTTCATC TAGAGTACTT TGTGGCTAAT 
3 61 TTGGTTTGGC ATTTTCGATG GGAGGCTGTG GAGGGAGATG ATGTTGATCT TTCAGAAAAG 
421 CTAGAATTCA CCGTTGTGAT GAAGAATCCA CTTCGAGCTC GTATCTGCCT CAGAGTTAAC 
481 TCTATTTGAA TTTGGTAATT ACTAGTTCTT TCTATTTGCA TTGTTCCCTG TTGATGGACT 
541 TCCCCCATAT AGTACTGGAA GTTAGAGGGA G 



SEQ. ID. NO. 474 

1 EDDLHKMPYL RAVILEGLRR HPPGHFVLPH TVTEEVELNG YWPKNATIN FMVADMGLDP 
61 KVWEDPLEFK PERFLMEGSD KEGFDITGSR EIKMMPFGAG RRICPGYALA MLHLEYFVAN 
121 LVWHFRWEAV EGDDVDLSEK LEFTWMKNP LRARICLRVN SI 



SEQ. ID. NO. 475 

D279-AD1 

1 GCAGTGCTGA CTTGGCGTAC GAAGCTTTAA TTGAAAAGGG TCAAATTTTC GCTACCCGCC 
61 CGAGTGAGAC CCCGACCCGA ACCATTTTCA GTTGCAACAA GTTCAGTGTC AATGCAGCAA 
121 TTTACGGGCC ACTTTGGCGA TCCCTGAGAA GAAATATGGT CCAAAATATG CTTAGTTCGA 
181 GTAGAGTGAA AGAATTTCGT GATTGTAGAG AAATGGCAAT GGATAAGTTG ATTGAAAGGA 
241 TACATGCAGA TGCTAAGGCC AATAACGATA TCGTTTGGGT GCTGAAAAAC GCGCGTTTCG 
3 01 CGGTGTTTTA TATACTTTTG ACTATGTGTT TTGGTGTTGA AATGGATGAG AAAATGATTG 
361 AAACCGTTGA TCAAATGATG AAGGATGTGC TAATGGAGCT TCATCCAAGA ATAGATGATT 
421 TTCTTCCTAT ATTGAGCTTG TTTGTTGGTT ATAAACAACG TAAGAGAGTT CACGAAGTTC 
481 GCAAGAGACA AATAGAAACA CTTGTCCCTT TGATCGAGAA ACGTCGAAGA GCAATACAAA 
541 ATCCTGGGTA CGATAAGACA GTTGCCTCGT TTTCGTACTT AGATACTTTA TTTGATGTTA 
601 AGGTCGAAGG TACAAAGTCA GGACCAACGA ATCCGGAGCT TGTAACACTA TGTTCAGAGT 
661 TCTTGAATGG TGGGACCGAC ACAACCGCTA CCGCAATAGA GTGGGCCATA GGGAGAATGA 
721 TCGAAAATCC AAGCATACAG AAGAGAATAT ACGAAGAGAT TAGAAATACA GTGGGTGACA 
7 81 GAAAGATTGA CGAAAAGGAT ATGGATAAGA TGCCTTATTT AAACGCCGTT GTAAAGGAGC 
841 TTTTACGTAA ACATCCTCCT ACGTACGTTA CATTTACCCA TGCAGTAACG GAGCCAACAA 
9 01 CATTGGGTGG GTATGACATA CCCACATATG CTAATGTAGA GTTTTTTGTA CCCGGGATCT 
961 CGGATGACCC GAAAGTTTGG TCTGATCCGG AAAAGTTTGA CCCGGATAGG TTTCTATCCG 
1021 GGCGGGAGGA CGCTGATATA ACGGGTGTGA CCGGGGTAAA GATGATGCCA TTTGGGGTCG 
1081 GGCGGAGGAT TTGTCCGGGC TTGGGCTTGG CAACGGTGCA TGTGAATTTG ATGTTGGCCC 
1141 GAATGATTCA AGAATTTGAA TGGTCCGCTT ACCCGGAAAA TAGGAAAGTG GATTTTACTG 
12 01 AGAAATTGGA ATTTACTGTG GTGATGAAAA ACCCTTTAAG AGCTAAGGTC AAGCCAAGAA 
1261 TGCAAGTGGT GTAATTCATT AAGATTATAA GTCC 



SEQ. ID. NO. 476 

1 SADLAYEALI EKGQIFATRP SETPTRTIFS 
61 RVKEFRDCRE MAMDKLIERI HADAKANNDI 
121 TVDQMMKDVL MELHPRIDDF LPILSLFVGY 
181 PGYDKTVASF SYLDTLFDVK VEGTKSGPTN 
241 ENPSIQKRIY EEIRWTVGDR KIDEKDMDKM 
3 01 LGGYDIPTYA NVEFFVPGIS DDPKVWSDPE 
3 61 RRIGPGLGLA TVHVNLMLAR MIQEFEWSAY 
421 QW 



CNKFSVNAAI YGPLWRSLRR NMVQNMLSSS 
VWVLKNARFA VFYILLTMCF GVEMDEKMIE 
KQRKRVHEVR KRQIETLVPL lEKRRRAIQN 
PELVTLCSEF LNGGTDTTAT AIEWAIGRMI 
PYLNAWKEL LRKHPPTYVT FTHAVTEPTT 
KFDPDRFLSG REDADITGVT GVKMMPFGVG 
PENRKVDFTE KLEFTWMKN PLRAKVKPRM 
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SBQ. ID. NO. 477 

D282-AA10 

1 ACAGCATCTT GTGCTGCCAT AATAATTACT CTAGTGGTGT GTATATGGAG AGTGCTGAAT 
61 TGGGTTTGGT TCAGACCAAA GAAGCTGGAA AAGCTACTGA GGAAACAAGG TCTCAAAGGC 
121 AATTCCTACA GGATTTTGTA TGGGGATATG AAGGAGCTTT CTGGTATGAT TAAGGAAGCT 
181 AACTCCAAAC CCATGAATCT TTCTGATGAT ATTGCCCCAA GATTGGTCCC TTTCTTTCTT 
241 GATACCATCA AGAAATATGG GAAAAAATCC TTTGTATGGT TGGGTCCAAA ACCGCTGGTT 
3 01 TTTGTCATGG ACCCCGAGCT TATAAAGGAA GTATTCTCCA AAAACTATCT GTATCAAAAG 
3 61 CCTCATTCAA ATCCATTAAC CAAGTTACTG GCACAAGGAC TTGTAAGCCA AGAGGAAGAC 
421 AAATGGGCCA AACATAGAAA AATCGTCACT CCTGCCTTCC ACCTGGAGAA GCTAAAGGTT 
481 TGTAATTGCA ATGGATTCTC CGTAAGCAGT TCATTTTTAT CTTTTCTTTC ATCGTAACAG 
541 AAACTTACTC TTGATTGTGA TTCTTTTTTG AGGTACTTGT TTGGAACCAT TTTGTATTCC 
601 TAATTACAAA GTCGGAACAT TTTGAATAAG TGTCCCTTCA TTGACTTTCT CTTAACCATT 
661 TCTTAAACAG CATATGCTTC CAGCTTTTTG TTTGAGCTGT ACTGAGATGC TGAGCAAATG 
721 GGAAGACATT GTTGCAGTTG AGGGCTCACA TGAGATAGAT ATATGGCCTG GCCTTCAACA 
781 ATTAACTAGT GATGTGATCT CTCGGACAGC CTTTGGCAGT AGCTATGAAG CAGGTAGAAG 
841 GATATTTGAA CTTCAAAAGG AACAAGCTCA ATTTCTTATG GAAGCTATAC GCTCCGTTTA 
901 TATTCCAGGC TGGAGGTTTT TGCCAACAAA GAGGAACAGA AGAATGAAGG AAATTGAAAA 
961 GGATGTTCAA GCCTTAGTTA AAGGTATTAT TGATAAAAGA GTAAAGTCAA TGAAAGCAGG 
1021 AGAGGTGAAT AATGAGGATC TGCTTGGTAT ATTGCTGGAA TCTAATTTTA AAGAAATTGA 
1081 ACAGCATGGA AACAAGGATT TTGGAATGAG CATTGAAGAA GTCATTCAAG AATGCAAGTT 
1141 ATTCTATTTT GCTGGCCAAG AAACTACATC AGTGTTGCTT GTATGGACTC TAATATTGCT 
12 01 GAGCAGGCAT CAGGATTGGC AAGCACTGGC CAGAGAAGAG GTGTTGCAAG TCTTTGGGAA 

12 61 TCAGAAACCA GATTTTGATG GATTAAATCG TCTAAAAATT GTTACAATGA TCTTGTACGA 

13 21 GTCTTTAAGG CTCTATCCCC CAGTAGTGAC ACTTACCCGA AGGCCTAAGG AAGACACTGT 
1381 ATTAGGAGAT GTATCTCTAC CAGCAGGTGT GTTAATCTCC TTACCAGTGA TCTTATTGCA 
1441 TCACGACGAA GAGATATGGG GTAAAGATGC AAAGAAGTTC AAGCCAGAGA GATTCAGAGA 
1501 TGGAGTCTCA AGTGCAACAA AGGGTCAAGT CACTTTTTTC CCATTTACTT GGGGTCCCAG 
1561 AATATGCATT GGACAAAATT TTGCCATGTT AGAAGCAAAG ACTACTTTGG CTATGATCCT 
1621 ACAACGCTTC TCCTTTGAAC TGTCTCCATC TTATGCACAT GCTCCTCAGT CCATAATAAC 
1681 TTTGCAACCC CAGTATGGTG CTCCACTTAT TTTGCATAAA ATATAGTTTA TTACTTGTAA 
1741 GTAGTGTCTC GTTTTATGTT AAGCATGAGT CCAAAATGTT AAGGCTTGTA GAACTGCAAA 
1801 ATGGGAATGA ATCACTAGTG AATTC 

SEQ. XD. NO. 478 

1 KQHMLPAFCL SCTEMLSKWE DIVAVEGSHE IDIWPGLQQL TSDVISRTAF GSSYEAGRRI 
61 FELQKEQAQF LMEAIRSVYI PGWRFLPTKR NRRMKEIEKD VQALVKGIID KRVKSMKAGE 
121 VNNEDLLGIL LESNFKEIEQ HGNKDFGMSI EEVIQECKLF YFAGQETTSV LLVWTLILLS 
181 RHQDWQALAR EEVLQVFGNQ KPDFDGLNRL KIVTMILYES LRLYPPWTL TRRPKEDTVL 

2 41 GDVSLPAGVIi ISLPVILLHH DEEIWGKDAK KFKPERFRDG VSSATKGQVT FFPFTWGPRI 

3 01 CIGQNFAMLE AKTTLAMILQ RFSFELSPSY AHAPQSIITL QPQYGAPLIL HKI 

SEQ. ID. NO. 479 

D295-AA1 

1 CATCAACGAG CACAAGAAAA ATCTTGCAGC TGGCAAGAGT AATGGTGCAT TAGGAGGTGA 
61 AGATCTAATT GATGTCCTAC TAAGACTTAA GAATGATACA AGTCTTCAAT TTCCCATCAC 
121 CAACAACAGT ATCAAAGCTG TTATTGTTGA CATGTTTGCT GCCGGAACGG AAACTTCATC 
181 AACAACAACT GTATGGGCTA TGGCTGAAAT GATGAAGAAT CCAAGTGTAT TCACCAAAGC 
241 TCAAGCAGAA GTGCGAGAAG CCTTTAGGGA CAAAGTATCT TTTGATGAAA ACGATGTGGA 
301 GGAGCTGAAA TACTTAAAGT TAGTCATTAA AGAAACTTTG AGACTTCATC CACCCTCTCC 
3 61 ACTTTTGGTC CCAAGAGAAT GCAGGGAAGA CACAGATATA AACGGCTACA CTATTCCTGC 
421 GAAGACCAAA GTTATGGTTA ATGTTTGGGC ATTGGGAAGA GATCCAAAAT ATTGGGATGA 
481 CGCCGAAAGC TTTAAGCCAG AGAGATTTGA GCAATGCTCT ATGGATTTTT TTGGTAATAA 
541 TTTTGAGTTT CTTCCCTTTG GTGGTGGACG GAGAATTTGT CCTGGAATGT CATTTGGTTT 
601 AGCTAATCTT TACTTGCCAT TGGCCCAATT GCTGTATCAC TTCGACTGGA AACTCCCAAC 
661 TGGAATCAAG CCAAGAGACT TGGACTTGAC TGAATTATCT GGAATAACTA TTGCTAGAAA 
721 GGGTGACCTT TACTTAAATG CCACTCCTTA TCAACCGTCG AGACTAATTT AATATTGGCA 
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781 TCAACTTTTT AAATTTTCTT CATCAACCTC ATTATTAATG TACAATAATC TTTCTTCTGT 
841 TGTTGTAGGC TTTATCGATT TCCAATACAT GTATTCTTTA TTTAAAAATG TATCACATTC 
901 CATGTATAAA AAAAAAAAAA 

SBQ. ID. NO. 480 

1 INEHKKNLAA GKSNGALGGE DLIDVLLRLK NOTSLQFPIT NNSIKAVIVD MFAAGTETSS 
61 TTTVWAMAEM MKNPSVFTKA QAEVREAFRD KVSFDENDVE ELKYLKLVIK ETLRLHPPSP 
121 LLVPRECRED TDINGYTIPA KTKVMVNVWA LGRDPKYWDD AESFKPERFE QCSMDFFGNN 
181 FEFLPFGGGR RICPGMSPGL ANLYLPLAQL LYHFDWKLPT GIKPRDLDLT ELSGITIARK 
241 GDLYLNATPY QPSRLI 

SEQ. ID. NO. 481 

D101A-AE2 

1 AAATAATGGA TTATCATATT TCTTTCCATT TTCAAGCTCT TTTAGGGCTT TTAGCCTTTG 
61 TGTTCTTGTC TATTATCTTA TGGAGAAGAA CACTCACTTC AAGAAAATTA GCCCCTGAAA 
121 TCCCAGGGGC ATGGCCTATT ATAGGCCATC TTCGTCAGCT GAGTGGTACT GATAAGAATA 
181 TCCCATTTCC CCGAATATTG GGCGCTTTGG CAGATAAATA TGGACCTGTC TTCACACTGA 
241 GAATAGGGAT GTACCCCTAT TTGATTGTCA ACAATTGGGA AGCAGCTAAG GATTGTCTCA 
3 01 CAACGCATGA TAAGGACTCC GCTGCCCGAC CAACTTCTAT GGCTGGTGAA AGCATCGGGT 
3 61 ACAAGTATGC GAGGTTTACT TATGCTAATT TTGGTCCTTA TTATAACCAA GTGCGCAAAC 
421 TAGCCCTACA ACATGTACTC TCGAGTACTA AACTCGAGAA AATGAAACAC ATACGTGTTT 
481 CTGAATTGGA AACTAGCATC AAAGAATTAT ATTCTTTGAC GCTGGGCAAA AACAACATGC 
541 AAAAAGTGAA TATAAGTAAA TGGTTTGAAC AATTGACTTT AAACATAATC GTGAAGACAA 
601 TTTGTGGCAA GAGATATAGC AACATAGAGG AGGATGAAGA GGCACAACGT TTCAGAAAGG 
661 CATTTAAGGG CATCATGTTT GTTGTAGGGC AAATTGTTTT ATATGACGCA ATTCCATTCC 
721 CATTGTTCAA ATACTTTGAT TTCCAAGGTC ATATACAATT GATGAACAAA ATTTATAAAG 
7 81 ACTTAGATTC TATTCTTCAA GGATGGTTGG ATGATCATAT GATGAACAAG GATGTAAACA 
841 ATAAGGATCA AGATGCCATA GATGCCATGC TTAAGGTAAC ACAACTTAAT GAATTCAAAG 
901 CCTATGGTTT TTCTCAGGCC ACTGTGATCA AGTCGACAGT CTTGAGTTTG ATCTTAGATG 
961 GAAATGACAC AACCGCTGTT CATTTGATAT GGGTAATGTC CTTATTACTG AACAATCCAC 
1021 ATGTTATGAA ACAAGGCCAA GAAGAGATAG ACATGAAAGT GGGTAAAGAG AGGTGGATTG 
1081 AAGATACTGA CATAAA?^AAT TTAGTGTACC TTCAGGCTAT CGTTAAAGAG ACATTGCGCT 
1141 TGTATCCACC TGTTCCTTTT CTTTTACCAC ACGAAGCAGT GCAAGATTGT AAAGTGACTG 
12 01 GTTACCACAT TCCTAAAGGT ACTCGTCTAT ATATCAATGC GTGGAAAGTA CATCGCGATC 

12 61 CTGAAATTTG GTCAGAGCCC GAAAAGTTTA TGCCCAATAG ATTCTTGACT AGCAAAGCAA 
1321 ATATAGATGC TCGCGGTCAA AATTTTGAAT TTATACCGTT TGGTTCTGGG AGACGGTCAT 

13 81 GTCCAGGGAT AGGT 

SEQ. ID. NO. 482 

1 MDYHISFHFQ ALLGLLAFVF LSIILWRRTL TSRKLAPEIP GAWPIIGHLR QLSGTDKNIP 
61 FPRILGALAD KYGPVFTLRI GMYPYLIVNN WEAAKDCLTT HDKDSAARPT SMAGESIGYK 
121 YARFTYANFG PYYNQVRKLA LQHVLSSTKL EKMKHIRVSE LETSIKELYS LTLGKNNMQK 
181 VNISKWFEQL TLNIIVKTIC GKRYSNIEED EEAQRFRKAF KGIMFWGQI VLYDAIPFPL 
241 FKYFDFQGHI QLMNKIYKDL DSILQGWLDD HMMNKDVNNK DQDAIDAMLK VTQLNEFKAY 
3 01 GFSQATVIKS TVLSLILDGN DTTAVHLIWV MSLLLNNPHV MKQGQEEIDM KVGKERWIED 
3 61 TDIKNLVYLQ AIVKETLRLY PPVPFLLPHE AVQDCKVTGY HIPKGTRLYI NAWKVHRDPE 
421 IWSEPEKFMP NRFLTSKANI DARGQNFEFI PPGSGRRSCP GIG 

SEQ. ID. NO. 483 

D108-AA4 

1 AATAGGCCAA AATATATGGA AGGAGATATT GTTGATCTTT TGCTACAATT GAAGAAAGAA 
61 CAATCAACAC CAATTGATCT TACTTTGGAG GATATAAAAG GAATTCTCAT GAATGTGTTG 
121 GTTGCAGGAT CAGACACTAG TGCAGCTGCT ACTGTTTGGG CAATGACAGC CTTGATAAAA 
181 AATCCAAAAA CCATGAAAAA AGTTCAATCA GAAATCAGAA AATCAATAGG GAAGAAAGGA 
241 ATTGTAAATG AAGAAGATGT CCAA?UVCATG CCTTATCTCA AAGCAGTGAT AAAGGAAATA 
301 TTTAGATTGT ATCCACCAGT TCCACTTCTA GTTCCAAGAG AATCAATGGA AAAAACCATA 
361 TTAGAAGGTT ATGAAATTCG GCCAGGAACC ATAGTTCATG TTAACGCTTG GGCTATTGCG 
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421 AGGGATCCTG AAATATGGGA AAATCCAGAA GAATTTATAC CTGAGAGATT ACTTGAATAG 
SEQ. ID. NO. 484 

1 NRPKYMEGDI VDLLLQLKKE QSTPIDLTLE DIKGILMNVL VAGSDTSAAA TVWAMTALIK 
61 NPKTMKKVQS EIRKSIGKKG IVNEEDVQNM PYLKAVIKEI FRLYPPVPLL VPRESMEKTI 
121 LEGYEIRPGT IVHWAWAIA RDPEIWENPE EFIPERLLE 

SEQ. 10. NO. 485 

D124-AC5(5») 

1 GCAAGTTCTT TCACTAGTTG TAATAGTCCT ATACATTCTT CAACAATTAC ACAACAAATT 
61 CATGAAAAAG AAAAAGAAAC TTCCTCCAGG TCCAAAAGGG TTTCCAATTA TTGGAAATCT 
121 ATTTATGATT GGCAAAAACC TACATCAAGA TCTTTATCAA ATAGCCAAAA AACATGGTCC 
181 TATAATGAGT ATGAGATTTG GTCTAGTTCC TATCATTGTT GCTTCATCTC CTCATGCTGC 
241 TGAACTATTC TTGAAAAAAC ATGATCTTGT TTTTGCTAGT AGACCATATA ATACAGCTGC 
301 TCAATATATT GGATATAATC AAAGAAATCT TACTTTTGGT AAATATGGTC CTTATTGGCG 
3 61 AAATATGGGA AAATTATGTA CGTTAGAATT GCTTAGTAGT CTCAAGATCA ATTCATTTCA 
421 GGCCATGAGA AAACAAGAAA TTGGAAATTT TGTGACTTTT CTCAATAGAG CAGCTTCTAA 
481 TGGTATTGAG GTTGATATTA GTGCTAAACT TGCTTCATTA AGTGCAAATA TGGCTTGTTT 
541 AATGGTATTT GGGAAGAAAT ATATGGATGA AGAATTTGAT GAAAGGGGTT TTAAAGATGT 
601 AATTCAAGAG ACATTAGTTA TAACTGCAAC ACCAAATATT GGTGAGTTTT TTCCTTTTCT 
661 TGATAGGTTT GATTTGCAGG GATTTGTTCC ACGTATGAAA AAATTGGCAA AGATTTTTGA 
721 TGATTTTTTC GAGAAA 

SEQ. ID. NO. 486 

1 QVLSLWIVL YILQQLHNKF >nCKKKKLPPG PKGFPIIGNL FMIGKNLHQD LYQIAKKHGP 
61 IMSMRFGLVP IIVASSPHAA ELFLKKHDLV FASRPYNTAA QYIGYNQRNL TFGKYGPYWR 
121 NMRKLCTLEL LSSLKINSFQ AMRKQEIGNF VTFLNRAASN GIEVDISAKL ASLSANMACL 
181 MVFGKKYMDE EFDERGFKDV IQETLVITAT PNIGEFFPFL DRFDLQGFVP RMKKLAKIFD 
241 DFFEK 

SEQ. ID. NO. 487 

D124-AC5 (3 • ) 

1 CGTGAATCCA TTGAAGATTG TATTGTTGAT GGTTTTGATA TACCTAAAGG TTCAAGAATT 
61 TTAGTAAATA CTTGGGCAAT TGGAAGAGAT CCAGAAGCCT GGCCCGAACC CGAGAAGTTC 
121 AAGCCAGAAA GGTTTGTTGG TAGCAACATC GATCTTAGGG GACGTGATTT TCAACTTTTA 
181 CCATTTGGCT CAGGGAGAAG GAGTTGCCCC GGATTACAAT TAGGGCTCAC CATTGTTCGC 
241 TTGGTGTTAG CGCAATTGGT TCATTGCTTT GATTGGGAAC TACCAAATGG TATGGCGCCA 
301 GAAGATTTAG ATATGACTGA GAAATTTGGT TTAGTTACAG CTAGAGCTCA ACATTTAGTT 
3 61 GCTATTCCTA CTTATCAATT GCATGTGTAG TTGATGTGAA TCTTCCTAGA AGCTTAATTA 
421 AGTAAAAATG AAGATCTTTG TCCTCTATAT TTTTCATTGT ATGACAAAAT TTGGTAACAT 
481 TTACTACATA TGGAGCTACC ACTAAAAAA 

SEQ. ID. NO. 488 

1 RESIEDCIVD GFDIPKGSRI LVNTWAIGRD PEAWPEPEKF KPERFVGSNI DLRGRDFQLL 
61 PFGSGRRSCP GLQLGLTIVR LVLAQLVHCF DWELPNGMAP EDLDMTEKFG LVTARAQHLV 
121 AIPTYQLHV 

SEQ. ID. NO. 489 

D141-AD7 

1 TCGACGACTA TAATTTGGGC ATTAGCTGAA ATGATGAAGA AACCAAGTGT TTTAGCAAAG 
61 GCACAAGCTG AAGTAAGGCA AGCTTTGAAG GAGAAAAAAG GTTTTCAACA GATTGATCTT 
121 GATGAGCTAA AATATCTCAA GTTAGTAATC AAAGAAACCT TAAGAATGCA CCCTCCAATT 
181 CCTCTATTAG TTCCTAGAGA ATGTATGGAG GATACAAAGA TTGATGGTTA CAATATACCT 
241 TTCAAAACAA GAGTCATAGT TAATGCATGG GCAATCGGAC GAGATCCAGA AAGTTGGGAT 
3 01 GACCCCGAAA GCTTTATGCC AGAGAGATTT GAGAATAGTT CTATTGACTT TCTTGGAAAT 
3 61 CATCATCAGT TTATACCATT TGGTGCAGGA AGAAGGATTT GTCCGGGAAT GCTATTTGGT 
421 TTAGCTAATG TTGGACAACC TTTAGCTCAG TTACTTTATC ACTTCGATTG GAAACTCCCT 
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481 AATGGACAAA GTCATGAGAA TTTCGACATG ACTGAGTCAC CTGGAATTTC TG 
SBQ. ID. NO. 490 

1 STTIIWAIiAE MMKKPSVLAK AQAEVRQALK EKKGFQQIDL DELKYLKLVI KETLRMHPPI 
61 PLLVPRECME DTKIDGYNIP FKTRVIVNAW AIGRDPESWD DPESFMPERP ENSSIDFLGN 
121 HHQFIPFGAG RRICPGMLFG liAWGQPLAQ LLYHFDWKLP NGQSHENFDM TESPGIS 



SBQ. ID. NO. 491 

D148-AD1 

1 TGCTCTTTCT ACTCTTTGTA GCCCTTCCTT TCATTCTTAT TTTTCTTCTT CCTAAATTCA 
61 AAAATGGTGG AAATAACAGA TTGCCACCAG GTCCTATAGG TTTACCATTC ATTGGAAATT 
121 TGCATCAATA CGATAGTATA ACTCCTCATA TCTATTTTTG GAAACTTTCA AAAAAATATG 
181 GCAAAATCTT CTCATTAAAA CTTGCTTCTA CTAATGTGGT AGTAGTTTCT TCAGCAAAAT 
241 TAGCAAAAGA AGTATTGAAA AAACAAGATT TAATATTTTG TAGTAGACCA TCTATTCTTG 
3 01 GCCAACAAAA ACTGTCTTAT TATGGTCGTG ATATTGCTTT TAATGATTAT TGGAGAGAAA 
3 61 TGAGAAAAAT TTGTGTTCTT CATCTTTTTA GTTTAAAAAA AGTTCAATTA TTTAGTCCAA 
421 TTCGTGAAGA TGAAGTTTTT AGAATGATTA AGAAAATATC AAAACAAGCT TCTACTTCAC 
481 AAATTATTAA TCTGAGTAAT TTAATGATTT CATTAACAAG TACAATTATT TGTAGAGTTG 
541 CTTTTGGTGT TAGGATTGAA GAAGAAGCAC ATGCAAGGAA GAGATTTGAT TTTCTTTTGG 
601 CCGAGGCACA AGAAATGATG GCTAGTTTCT TTGTATCTGA TTTTTTTCCC TTTTTAAGTT 
661 GGATTGATAA ATTAAGTGGA TTGACATATA GACTTGAGAG GAATTTCAAG GATTTGGATA 
7 21 ATTTTTATGA AGAACTCATT GAGCAACATC AAAATCCTAA TAAGCCAAAA TATATGGAAG 
7 81 GAGATATTGT TGATCTTTTG CTACAATTGA AGAAAGAGAA ATTAACACCA CTTGATCTCA 
841 CTATGGAAGA TATAAAAGGA ATTCTCATGA ATGTGTTAGT TGCAGGATCA GACACTAGTG 
901 CAGCTGCTAC TGTTTGGGCA ATGACAGCCT TGATAAAGAA TCCTAAAGCC ATGGAAAAAG 
9 61 TTCAATTAGA AATCAGAAAA TCAGTTGGGA AGAAAGGCAT TGTAAATGAA GAAGATGTCC 
1021 AAAACATCCC TTATTTTAAA GCAGTGATAA AGGAAATATT TAGATTGTAT CCACCAGCTC 
1081 CACTTTTAGT TCCAAGAGAA TCAATGGAAA AAACCATATT AGAAGGTTAT GAAATTCGGC 
1141 CAAGAACCAT AGTTCATGTT AACGCTTGGG CTATAGCAAG GGATCCTGAA ATATGGGAAA 
1201 ATCCAGATGA ATTTATACCT GAGAGATTTT TGAATAGCAG TATCGATTAC AAGGGTCAAG 

12 61 ATTTTGGGTT ACTTCCATTT GGTGCAGGCA GAAGAGGTTG CCCAGGTATT GCACTTGGGG 

13 21 TTGCATCCAT GGAACTTGCT TTGTCAAATC TTCTTTATGC ATTTGATTGG GAGTTGCCTT 
13 81 ATGGAGTGAA AAAAGAAGAC ATCGACACAA ACGTTAGGCC TGGAATTGCC ATGCACAAGA 
1441 AAAACGAACT TTGCCTTGTC CCAAAAAATT ATTTATAAAT TATATTGGGA CGTGGATCTC 
1501 ATGCTAGTTC TGTGAAGG 



SBQ. ID. NO. 492 

1 LFLLFVALPF ILIFLLPKFK NGGNNRLPPG PIGLPFIGNL HQYDSITPHI YFWKLSKKYG 

61 KIFSLKLAST NVVWSSAKL AKEVLKKQDL IFCSRPSILG QQKLSYYGRD lAFND^Tft/REM 

121 RKICVLHLFS LKKVQLFSPI REDEVFRMIK KISKQASTSQ IINLSNLMIS LTSTIICRVA 

181 FGVRIEEEAH ARKRFDFLLA EAQEMMASFF VSDFFPFLSW IDKLSGLTYR LERNFKDLDN 

241 FYEELIEQHQ NPNKPKYMEG DIVDLLLQLK KEKLTPLDLT MEDIKGILMN VLVAGSDTSA 

301 AATVWAMTAL IKNPKAMEKV QLEIRKSVGK KGIVNEEDVQ NIPYFKAVIK EIFRLYPPAP 

3 61 LLVPRESMEK TILEGYEIRP RTIVHVNAWA lARDPEIWEN PDEFIPERFL NSSIDYKGQD 

421 FGLLPFGAGR RGCPGIALGV ASMELALSNL LYAFDWELPY GVKKEDIDTN VRPGIAMHKK 

481 NELCLVPKNY L 



SBQ. ID. NO. 493 

D212-BC11 

1 CTCATTATCC ATCACCTAAA ATGGAGAATT CTTGGGTTTT TCTAGCCTTG GCAGGGCTAT 
61 CTGCATTAGC TTTTCTCTGT AAAATAATCA CCTGTCGAAG ACCGGTTAAC CGGAAAATAC 
121 CACCAGGTCC AAAACCATGG CCCATCATTG GCAATTTGAA CCTACTTGGT CCTATACCAC 
181 ATCAATCTTT TGACTTGCTT TCCAAAAAAT ATGGAGAGTT GATGCTGCTG AAATTTGGCT 
241 CCAGGCCAGT TCTTGTTGCT TCATCTGCTG AAATGGCAAA ACAGTTTTTA AAAGTACATG 
3 01 ATGCTAATTT CGCCTCCCGT CCTATGCTAG CTGGTGGAAA GTATACAAGC TATAACTATT 
3 61 GTGACATGAC ATGGGCACCC TATGGTCCCT ATTGGCGCCA AGCACGACGA ATTTACCTTA 
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421 ACCAGATATT TACTCCGAAA AGGCTAGACT CGTTCGAGTA CATTCGTGTT GAAGAAAGGC 
481 AGGCCTTGAT TTCCCAGCTG AATTCCCTTG CTGGAAAGCC ATTTTTTCTC AAAGACCATT 
541 TGTCGCGATT TAGCCTCTGC AGCATGACAA GGATGGTTTT GAGCAACAAG TATTTTGGTG 
601 AATCAACAGT TAGAGTAGAA GATTTGCAGT ACCTGGTAGA TCAATGGTTC TTACTTAATG 
661 GTGCTTTCAA CATTGGAGAT TGGATTCCAT GGCTCAGCTT CTTGGACCTA CAAGGCTATG 
721 TGAAACAAAT GAAGGCTTTG AAAAGAACTT TTGATAAGTT CCACAACATT GTGCTAGATG 
781 ATCACAGGGC TAAGAAGAAT GCAGAGAAGA ACTTTGTCCC AAAAGACATG GTTGATGTCT 
841 TGTTGAAGAT GGCTGAAGAT CCTAATCTGG AAGTCAAACT CACTAATGAC TGTGTCAAAG 
901 GGTTAATGCA GGATTTACTA ACTGGAGGAA CAGATAGCTT AACAGCAGCA GTGCAATGGG 
961 CATTTCAAGA ACTTCTTAGA CAGCCAA 

SBQ. ID* NO. 494 

1 MENSWVFLAL AGLSAIiAFLC KIITCRRPVN RKIPPGPKPW PIIGNLNLLG PIPHQSFDLL 

61 SKKYGELMLL KFGSRPVLVA SSAEMAKQFL KVHDANFASR PMLAGGKYTS YNYCDMTWAP 

121 YGPYWRQARR lYLNQIFTPK RLDSFEYIRV EERQALISQL NSLAGKPFFL KDHLSRFSLC 

181 SMTRMVLSNK YFGESTVRVE DLQYLVDQWF LLNGAFNIGD WIPWLSFLDL QGYVKQMKAL 

241 KRTFDKFHNI VLDDHRAKKN AEKNTFVPKDM VDVLLKMAED PNLEVKLTND CVKGLMQDLL 

3 01 TGGTDSLTAA VQWAFQELLR QP 



SEQ. XD. NO. 495 

D217-AB10 

1 ATGATCGCTG GGAAAAATTA TGGATCCGGT AAAGGAGATG AACAAGTGGA GAGATTTGGG 
61 AAAGCGTTTA AGGATTTTAT AATTTTATCA ATGGAGTTTG TGTTATGGGA TGCTTTTCCA 
121 ATTCCATTGT TCAAATGGGT GGATCTTCAA GGCCATGTTA AGGCCATGAA AAGGACATTT 
181 AAGGATATAG ATTCTGTTTT TCAGAATTGG TTAGAGGAAC ATGTCAAGAA AAAAGAAAAA 
241 ATGGAGGTTA ATGCAGAAGG AAATGAACAA GATTTCATTG ATGTGGTGCT TTCAAAAATG 
301 AGTAATGAAT ATCTTGATGA AGGCTACTCT CGTGATACTG TCATAAAAGC AACAGTGTTT 
361 AGTTTAGTCT TGGATGCTGC GGACACAGTT GCTCTTCACA TGAATTGGGG AATGGCATTA 
421 TTGACAAACA ATCAACATGC CTTGAAGAAA GCGCAAGAAG AGATAGATAA AAAAGTTGGT 
481 AAGGATAGAT GGGTAGAAGA GAGTGATATT AAGGATTTGG TATACCTCCA AACTATTGTT 
541 AAAGAAGTGT TACGATTATA TCCACCGGGA CCTTTATTAG TACCCCATGA AAATGTAGAG 
601 GATTGTGTTG TTAGTGGATA TCACATTCCT AAAGGGACTA GACTATTCGC GAACGTTATG 
661 AAATTACAGC GCGATCCTAA ACTCTGGTCA AATCCTGATA AGTTCGATCC AGAGAGATTT 
721 TTCGCTGCTG ATATTGACTT TCGTGGTCAA CACTATGAGT TTATCCCATT TGGTTCTGGA 
781 AGACGATCTT GTCCGGGGAT GACTTATGCA ATGCAAGTGG AACACCTAAC AATCGCACAC 
841 TTGATCCAGG GTTTCAATTA CAAAACTCCA AATGACGAGC CCTTGGATAT GAAGGAAGGT 
901 GCAGGATTAA CTATACGTAA GGTAAATCCT ATAGAAGTGG TAATTACGCC TCGCCTGACA 
961 CCTGAGCTTT ATTAAAATCT 

SSQ. ZD. NO. 496 

1 MIAGKNYGSG KGDEQVERFG KAFKDFIILS MEFVLWDAFP IPLFKWVDLQ GHVKAMKRTF 
61 KDIDSVFQNW LEEHVKKKEK MEVNAEGNEQ DFIDWLSKM SNEYLDEGYS RDTVIKATVF 
121 SliVLDAADTV ALHMNWGMAL LTNNQHALKK AQEEIDKKVG KDRWVEESDI KDLVYLQTIV 
181 KEVLRLYPPG PLLVPHENVE DCWSGYHIP KGTRLFANVM KLQRDPKLWS NPDKFDPERF 
241 FAADIDFRGQ HYEFIPFGSG RRSCPGMTYA MQVEHLTIAH LIQGFNYKTP NDEPLDMKEG 
301 AGLTIRKVNP lEWITPRLT PELY 

SEQ. ID. NO. 497 

D220-BC6 

1 GCAGTGATCT TAGAAGGTCT TAGGCGACAC CCGCCCGGTC ACTTTGTGCT GCCACATACG 
61 GVGACAGAGG AAGCAGAACT GAACGGCTAC GTCGTCCCAA AGAATGCCAC CATCAATTTC 
121 ATGGTTGCCG ACATGGGTTT GGACCCAAAG GTGTGGGAGG ATCCCTTGGA ATTTAAGCCA 
181 GAGAGGTTCT TAATGGAGGG ATCAGATAAG GAAGGGTTTG ATATAACAGG AAGTAGAGAG 
241 ATCAAGATGA TGCCATTTGG AGCTGGTAGG AGAATATGTC CTGGCTATGC TTTGGCTATG 
301 CTTCATTTGG AATACTTTGT GGCTAATTTG GTTTGGCATT TTCGATGGGA TGCTGTGGAG 
361 GGAGATGATG TTGATCTTTC AGAGAAGCTA GAATTCACCG TTGTGATGAA GAATCCTCTA 
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421 CGAGCTCGTA TCTGCCCTAG AGTTAACTCT GTTTGAATTT GGTAATTACT AGTAGTTCTT 
481 TCTATTTGCA TTGTTCCCTG TTGATGGACT TCC 



SBQ. XD. NO. 498 

1 AVILEGLRRH PPGHFVLPHT VTEEAELNGY WPKNATINF MVADMGLDPK VWEDPLEFKP 
61 ERFLMEGSDK EGFDITGSRE IKMMPFGAGR RICPGYALAM LHLEYFVANL VWHFRWDAVE 
121 GDDVDLSEKIi EFTWMKNPL RARICPRVNS V 



SBQ. ID. NO. 499 

D225-AG9 

1 TTCGCATGGA TAGGCCCAGA ACCCAGGATT 

61 ATAGTAACGA ACAACACCAT CTTCAAAAAA 

121 GTTAGTGGCA TCTCAAGTTA TGAGGACGAC 

181 ACTGCATTTT ACGCCGAGAA GTTGAAGTGT 

241 GATATGATCA ACAAGTGGGA AATTCTACTC 

301 CATCCATATT TTGAAGATTT TACCAGTGAT 

361 TATGCAGAAG GAACAAGAAT ATTTCATCTT 



TTCGTAATGA AACCAGAATT GATAAAGGAA 
CCAAAACCAG CCCCACTTGT CCAGCTTCTT 
AAATGGGCTA AGCACAGAAA AATTCTTAAC 
ATGCTGCCGG CAATGCACAC AAGCTGTGAA 
TCCGAAAACA AATCCTGCGA ATTGGACGTG 
GTGATTTCAA GAACAGCATT TGGAAGTAGT 
CAAAAAGAAC TAGCTGAACT CACACGCCA 



SBQ. X0. NO. 500 

1 FAWIGPEPRI FVMKPELIKE IVTNNTIFKK PKPAPLVQLL VSGISSYEDD KWAKHRKILN 
61 TAFYAEKLKC MLPAMHTSCE DMINKWEILL SENKSCELDV HPYFEDFTSD VISRTAFGSS 
121 YAEGTRIFHL QKELAELTR 



SBQ. ID. NO. 501 

D231-AF1 

1 ATGAGTTTAG TAAAATCAGT AGTGTATGAA 

61 CAATATGGTA AAGCCAAAGA AGATATCATA 

.121 AAGAAAGGTG AAATGATCTT TGGATATCAG 

181 GACAAACCAG AGGAGTTTAT TCCGGAGAGG 

241 TATGTGTATT GGTCAAATGC AAGAGAGACA 

3 01 CCAGCGAAAA ATCTTGTCGT GCTTTTGTGC 

361 TACGACACAT TCACAGTGGA GTCAACAAAG 

421 ACTCTGGAAA AAGCGACATG AGTTTCAGAT 

481 GGTCATTCTG CAAATTATTG TACTTGTGCT 

541 TGCACTGTTT TTAGAAA 



GTATTAAGAA TTGAACCTCC AGTTCCATTC 
ATCCAAAGCC ATGATTCT^C TTTCTTAGTC 
CCTTTTGCTA CAAAAGATCC AAAGATTTTT 
TTCATGGCCG AAGGGGAAAA ATTATTAAAG 
GATGATCCAA CGGTGGACGA CAAACAATGC 
AGGTTGATGT TGGTGGAGGT TTTCATGCGT 
CTCTTTCTTG GGTCATCAGT AACGTTCACG 
ATCTTAATTG TAGGCTGCAA ATAATAATGT 
GATGTACTTG ACTTCGAGTG GATATAATAA 



SBQ. XD. NO. 502 

1 MSLVKSWYE VLRIEPPVPF QYGKAKEDII IQSHDSTFLV KKGEMIFGYQ PFATKDPKIF 
61 DKPEEFIPER FMAEGEKLLK YVYWSNARET DDPTVDDKQC PAKNLWLLC RLMLVEVFMR 
121 YDTFTVESTK LFLGSSVTFT TLEKAT 



SBQ. ID. NO. 503 

D232-AH5 

1 GTTGGAACAA GTAGTTGGCA CAAACAGAAT GGTGGAAGAA TCAGATTTGG AAAAATTAGA 
61 TTACTTAGAT ATGGTTGTAA AAGAAGGTTT TAGGCTTCAC CCTGTTGCAC CACTATTACT 
121 TCCTCGTGAA TCCATTGAAG ATTGTATTGT TGATGGTTTT GATATACCTA AAGGTTCAAG 
181 AATTTTAGTA AATACTTGGG CAATTGGAAG AGATCCAGAA GCCTGGCCCG AACCCGAGAA 
241 GTTCAAGCCA GAAAGGTTTG TTGGTAGCAA CATCGATCTT AGGGGACGTG ATTTTCAACT 
3 01 TTTACCATTT GGCTCAGGGA GAAGGAGTTG CCCCGGATTA CAATTAGGGC TCACCATTGT 
3 61 TCGCTTGGTG TTAGCGCAAT TGGTTCATT 



SBQ. ID. NO. 504 

1 LEQWGTNRM VEESDLEKLD YLDMWKEGF RLHPVAPLLL PRESIEDCIV DGFDIPKGSR 
61 ILVNTWAIGR DPEAWPEPEK FKPERFVGSN IDLRGRDFQL LPFGSGRRSC PGLQLGLTIV 
121 RLVLAQLVH 
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SBQ. XD. NO. 505 

D240-BB8 

1 CTATATGGAG ACACAAAGGA GATGGCTGAG ATGACCAAAG AAGCCAAGTT TAAACCCATT 

61 AAACTCACTG ATGATATTCT CCCTCGGATC TTCCCTTTCT ACCATCATAC TTTCAACAAA 

121 TATGGTAACC ATTGTTTCGC ATGGATAGGC CCAGAACCCA GGATTTTCGT AATGAAACCA 

181 GAATTGATAA AGGAAATAGT AACGAACAAC ACCATCTTCA AAAAACCAAA ACCAGCCCCA 

241 CTTGTCCAGC TTCTTGTTAG TGGCATCTCA AGTTATGAGG ACGACAAATG GGCTAAGCAC 

3 01 GGAAAAATTC TTAACACTGC ATTTTACGCC GAGAAGTTGA AGTGTATGCT GCCGGCAATG 

3 61 CACACAAGCT GTGAAGATAT GATCAACAAG TGGGAAATTC TACTC 

SBQ. 10. NO. 506 

1 LYGDTKEMAE MTKEAKFKPI KLTDDILPRI FPFYHHTFNK YGNHCFAWIG PEPRIFVMKP 

61 ELIKEIVTNN TIFKKPKPAP LVQLLVSGIS SYEDDKWAKH GKILNTAFYA EKLKCMLPAM 
121 HTSCEDMINK WEILL 



SEQ. ID. NO. 507 

D280-AA6 

1 TACGTACGTT ACATTTACCC ATGCAGTAAC GGAGCCAACA ACATTGGGTG GGTATGACAT 

61 ACCCACATAT GCTAATGTAG AGTTTTTTGT ACCCGGGATC TCGGATGACC CGAAAGTTTG 

121 GTCTGATCCG GAAAAGTTTG ACCCGGATAG GTTTCTATCC GGGCGGGAGG ACGCTGATAT 

181 AACGGGTGTG ACCGGGGTAA AGATGATGCC ATTTGGGGTC GGGCGGAGGA TTTGTCCGGG 

241 CTTGGGCTTG GCAACGGTGC ATGTGAATTT GATGTTGGCC CGAATGATTC AAGAATTTGA 

301 ATGGTCCGCT TACCCGGAAA ATAGGAAAGT GGATTTTACT GAGAAATTGG AATTTACTGT 

361 GGTGATGAAA AATCCTTTAA GAGCTAAGGT CAAGCCAAGA ATGCAAGTGG TGTAATTCAT 
421 TAAGATTATA AGTCCAAAAA TAAGC 



SBQ. ID. NO. 508 

1 TYVTFTHAVT EPTTLGGYDI PTYANVEFFV PGISDDPKVW SDPEKFDPDR FLSGREDADI 
61 TGVTGVKMMP FGVGRRICPG LGLATVHVNL MLARMIQEFE WSAYPENRKV DFTEKLEFTV 
121 VMKNPLRAKV KPRMQW 

SEQ. ID. NO. 509 

D285-AD7 

1 AGAAGAGATA GACATGAAAG TGGGTAAAGA GAGGTGGATT GAAGATACTG ACATAAAAAA 
61 TTTAGTGTAC CTTCAGGCTA TCGTTAAAGA GACATTGCGC TTGTATCCAC CTGTTCCTTT 
121 TCTTTTACCA CACGAAGCAG TGCAAGATTG TAAAGTGACT GGTTACCACA TTCCTAAAGG 
181 TACTCGTCTA TATATCAATG CGTGGAAAGT ACATCGCGAT CCTGAAATTT GGTCAGAGCC 
241 CGAAAAGTTT ATGCCCAATA GATTCTTGAC TAGCAAAGCA AATATAGATG CTCGCGGTCA 
3 01 AAATTTTGAA TTTATACCGT TTGGTTCTGG GAGACGGTCA TGTCCAGGGA TAGGTTTTGC 
361 GACTTTAGTG ACACATCTGA CTTTTGGTCG CTTGCTTCAA GGTTTTGATT TTAGTAAGCC 
421 ATCAAACACG CCAATTGACA TGACAGAAGG CGTAGGCGTT ACTTTGCCTA AGGTTAATCA 
481 AGTTGAAGTT CTAATTATCC CTCGTTTACC TTCTAAGCTT TATTTATTTT GAAAGTGCAA 
541 ATCATCAATC ATGGGTTGAG TAATTAGTGA TACT 

SBQ. ID. NO. 510 

1 EEIDMKVGKE RWIEDTDIKN LVYLQAIVKE TLRLYPPVPF LLPHEAVQDC KVTGYHIPKG 
61 TRLYINAWKV HRDPEIWSEP EKFMPNRFLT SKANIDARGQ NFEFIPFGSG RRSCPGIGFA 
121 TLVTHLTFGR LLQGFDFSKP SNTPIDMTEG VGVTLPKVNQ VEVLIIPRLP SKLYLF 

SEQ. ID. NO. 511 

D285-AH9 

1 AACATTTGGC AATATAGTCT TCCTCCTCAG TTCTTGCCTC CTGTTCCTTA GAAATAATGG 
61 ATTATCATAT TTCTTTCCAT CTTCCAACTC TTTTTGGGCT TTTTGCCTTT GTGTTCTTGT 
121 CTATTATCTT ATGGAGAAGA ACACTCAATT CAAGAAGATT AGCCCCTGAA ATCCCAGGGG 
181 CATGGCCTAT TATAGGCCAT CTTCGTCAGC TGAGTGGTAC TGATAAGAAT ATCCCATTTC 
241 CCCGAATATT GGGCGCTTTG GCAGATAAAT ATGGACCTGT CTTCACACTG AGAATAGGGA 
301 TGTACCCCTA TTTGATTGTC AACAATTGGG AAGCAGCTAA GGATTGTCTC ACAACGCATG 
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361 ATAAGGACTT CGCTGCCCGA CCAACTTCTA TGGCTGGTGA AAGCATCGGG TACAAGTATG 
421 CGAGGTTTAC TTATGCTAAT TTTGGTCCTT ATTATAACCA AGTGCGCAAA CTAGCCCTAC 
481 AACATGTACT CTCGAGTACT AAACTCGAGA AAATGAAACA CATACGTGTT TCTGAATTGG 
541 AAACTAGCAT CAAAGAATTA TATTCTTTGA CGCTGGGCAA AAACAACATG CAAAAAGTGA 
601 ATATAAGTAA ATGGTTTGAA CAATTGACTT TAAACATAAT CGTGAAGACA ATTTGTGGCA 
661 AGAGATATAG CAACATAGAG GAGGATGAAG AGGCACAACG TTTCAGAAAG GCATTTAAGG 
721 GCATCATGTT TGTTGTAGGG CAAATTGTTT TATATGACGC AATTCCATTC CCATTGTTCA 
781 AATACTTTGA TTTCCAAGGT CATATACAAT TGATGAACAA AATTTATAAA GACTTAGATT 
841 CTATTCTTCA AGGATGGTTG GATGATCATA TGATGAACAA GGATGTAAAC AATAAGGATC 
901 AAGATGCCAT AGATGCCATG CTTAAGGTAA CACAACTTAA TGAATTCAAA GCCTATGGTT 
961 TTTCTCAGGC CACTGTGATC AAGTCGACAG TCTTGAGTTT GATCTTAGAT GGAAATGACA 
1021 CAACCGCTGT TCATTTGATA TGGGTAATGT CCTTATTACT GAACAATCCA CATGTTATGA 
1081 AACAAGGCCA AGAAGAGATA GACATGAAAG TGGGTAAAGA GAGGTGGATT GAAGATA 

SBQ. ZD. NO. 512 

1 MDYHISFHLP TLFGLFAFVF LSIILWRRTL NSRRLAPEIP GAWPIIGHLR QLSGTDKNIP 
61 FPRILGAIiAD KYGPVFTLRI GMYPYLIVNN WRAAKDCLTT HDKDFAARPT SMAGESIGYK 
121 YARFTYANFG PYYNQVRKLA LQHVLSSTKIi EKMKHIRVSE LETSIKELYS LTLGKNNMQK 
181 VNISKWFEQI, TLNIIVKTIC GKRYSNIEED EEAQRFRKAF KGIMFWGQI VLYDAIPFPL 
241 FKYFDFQGHI QLMNKIYKDL DSILQGWLDD HMMNKDVNNK DQDAIDAMLK VTQLNEFKAY 
3 01 GFSQATVIKS TVLSLILDGN DTTAVHLIWV MSLLLNNPHV MKQGQEEIDM KVGKERWIED 

SBQ. ID* NO. 513 

D99-AB3 

1 GTAAATGAAG AAGATGTCCA AAACATCCCT TATTTTAAAG CAGTGATAAA GGAAATATTT 
61 AGATTGTATC CACCAGCTCC ACTTTTAGTT CCAAGAGAAT CAATGGAAAA AACCATATTA 
121 GAAGGTTATG AAATTCGGCC AAGAGCCATA GTTCATGTTA ACGCTTGGGC TATAGCAAGG 
181 GATCCTGAAA TATGGGAAAA TCCAGATGAA TTTATACCTG AGAGATTTTT GAATAGCAGT 
241 ATCGATTACA GGGT 

SEQ* ID. NO. 514 

1 VNEEDVQNIP YFKAVIKEIF RLYPPAPLLV PRESMEKTIL EGYEIRPRAI VHVNAWAIAR 
61 DPEIWENPDE FIPERFLNSS IDYR 

SEQ. ZD. NO. 515 

D99-AC2 

1 AAGAGTAATG GTGCATTAGG AGGTGAAGAT CTAATTGATG TCCTACTAAG ACTTAAGAAT 

61 GATACAAGTC TTCAATTTCC CATCACCAAC AACAATATCA AAGCTGTTAT TGTTGACATG 

121 TTTGCTGCCG GAACGGAAAC TTCATCAACA ACAACTGTAT GGGCTATGGC TGAAATGATG 

181 AAGAATCCAA GTGTATTCAC CAAAGCTCAA GCAGAAGTGC GAGAAGCCTT TAGGGACAAA 

241 GTATCTTTTG ATGAAAACGA TGTGGAGGAG CTGAAATACT TAAAGTTAGT CATTAAAGAA 

3 01 ACTTTGAGAC TTCATCCACC CTCTCCACTT TTGGTCCCAA GAGAATGCAG GGAAGACACA 

3 61 GATATAAACG GCTACACTAT TCCTGCGAAG ACCAAAGTTA TGGTTAATGT TTGGGCATTG 

421 GGAAGAGATC CAAAATATTG GGATGACGCC GAAAGCTTTA AGCCAGAGAG ATTTGAGCAA 
481 TGCTCTATGG ATTTTTTTGG TAATAATTTT GAGTTTCTT 

SBQ. ZD. NO. 516 

1 KSNGALGGED LIDVLLRLKN DTSLQFPITN NNIKAVIVDM FAAGTETSST TTVWAMAEMM 

61 KNPSVFTKAQ AEVREAFRDK VSFDEXTOVEE LKYLKLVIKE TLRLHPPSPL LVPRECREDT 

121 DINGYTIPAK TKVMVNVWAIi GRDPKYWDDA ESFKPERFEQ CSMDFFGNNF EFLP 

SEQ. ZD. NO. 517 

D99-AF11 

1 TTAAACCACC CTGAAACTCT GAAGAAAGCA CAAGCTGAAA TTGATGAACA TATAGGACAT 
61 GAACGTTTAG TGGACGAGTC GGACATCAAC AACCTACCTT ACCTACGTTG TATAATCAAC 
121 GAGACATTCC GAATGTACCC TGCAGGACCA CTACTAGTCC CACACGAGTC GTCAGAGGAA 
181 ACCACCGTAG GAGGCTACCG TGTACCCGGA GGAACCATGT TACTTGTGAA TTTGTGGGCT 
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241 ATTCACAATG ATCCAAAGCT ATGGGATGAA CCAAGAAAGT TTAAGCCAGA AAGATTTGAA 
3 01 GGACTAGAAG GTGTTAGAGA TGGTTACAAA ATGATGCCCT 

SBQ. XD. NO. 518 

1 LNHPETLKKA QAEIDEHIGH ERLVDESDIN NLPYLRCIIN ETFRMYPAGP LLVPHESSEE 
61 TTVGGYRVPG GTMLLVNLWA IHNDPKLWDE PRKPKPERFE GLEGVRDGYK MMP 

SSQ. ID. NO. 519 

D99-AH4 

1 GTCAGGTTTC ATCCTATAGC TCCAGTATTA GCACCTAGAG AATCAAGGGA AGAGTGTGAG 
61 ATTAATGGCT ATGTTATACC AAAAGGCACA ATGGCCCTTG TGAATTTTTG GGCAATTTCT 
121 AGGGATCCAA ATTATTGGTC AAATCCTGAA ACATTTGATC CAGAGAGATT TAATGAAAGT 
181 CACCTTGATT TTACTGGAGC TCATTTCGAG TTTACGCC 

SBQ. ID. NO. 520 

1 VRFHPIAPVIi APRESREECE INGYVIPKGT MALVNFWAIS RDPNYWSNPE TFDPERFNES 
61 HLDFTGAHFE FT 

SEQ. ID. NO. 521 

D99-AH7 

1 GTTATTTGGG CATTAGCAGA AATGATAAAG AATCCAAGTG TAATGGCGAA AGCACAAGCA 
61 GAAGTGAGAG AAGCTTTTAA AGGAAAGAAA ACATGTGATG AGGATACTGA TCTTGAAAAG 
121 CTTAGTTACC TAAATTTAGT GATCAAAGAG ACACTCCGAT TACACCCTCC AACTCCTCTA 
181 CTTGTCCCGC GAGAATGCAG GGAGGAAACA GAGATTGAAG GATTCACTAT ACCGTTGAAA 
241 AGCAAAGTCT TGGTTAACGT ATGGGCAATT GGAAGAGATC CCGAGAATTG GGGAAATCCT 
301 GAATGTTTTA TACCAGAGAG ATTCGAAAAT AGTTCTATTG AGTTTACTGG AAATCATTTT 
3 61 CAACTTCTT 

SEQ. ID. NO. 522 

1 VIWALAEMIK NPSVMAKAQA EVREAPKGKK TCDEDTDLEK LSYLNLVIKE TLRLHPPTPL 
61 LVPRECREET EIEGFTIPLK SKVLVNVWAI GRDPENWGNP ECFIPERFEN SSIEFTGNHF 
121 QLL 

SEQ. ID. NO. S23 

D99-DB4 

1 GAATATCTTG GTGAAGGTTA CTCTCGTGAT ACTGTCATTA AAGCAACGGT GTTTAGTTTG 
61 GTCTTGGATG CAGCAGACAC AGTTGCTCTT CACATAAATT GGGGAATGGC ATTATTGATA 
121 AACAATCAAA AGGCCTTGAC GAAAGCACAA GAAGAGATAG ACACAAAAGT TGGTAAGGAC 
181 AGATGGGTAG AAGAGAGTGA TATTAAGGAT TTGGTATACC TCCAAGCTAT TGTTAAAGAA 
241 GTGTTACGAT TATATCCACC AGGACCTTTG TTAGTACCAC ACGAAAATGT AGAAGATTGT 
3 01 GTTGTTAGTG GATATCACAT TCCTAAAGGG ACAAGATTAT TCGCAAACGT CATGAAACTG 
3 61 CAACGTGATC CTAAACTCTG GTCTGATCCT GATACTTTCG ATCCAGAGAG ATTCATTGCT 
421 ACTGATATTG ACTTTCGTGG TCAGTACTAT AAGTATATCC CO 

SEQ. ID. NO. 524 

61 RWVEESDIKD LVYLQAIVKE VLRLYPPGPL LVPHENVEDC WSGYHIPKG TRLFANVMKL 
121 QRDPKLWSDP DTFDPERFIA TDIDFRGQYY KYI 
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SEQ. ID. no. 525 

D99-DG4 

1 CTTATGGTGG ATTTATTCAT TGCTGGAAGT GACACATCTG CTATAACAAC AGAATGGGCA 
61 ATGGCAGAAC TTCTTCGAAA ACCTCAAGTA CTTAACAAAG TAAGAGAAGA AATACTTCAA 
121 CAAATAGGCA CAGAAAGACC AGTGAAAGAA TCAGACATTG AGAAACTTCC ATACCTTCAA 
181 GCAGTTGTAA AAGAAGCAAT GAGACTTCAT CCGGCAGTTT CATTACTCTT GCCACACAAA 
241 GCCCAAAATG ATATACAAGT GTTGGGCTAC ACTGTGCCTA AGAACACTCA AGTTCTCGTG 
301 AATGCTTGGG CAATTGGAAG AGATCCAAAA TCCTGGGATA AGCCACTGGA GTTTATGCCT 
3 61 GAAAGATTCA TAAAGTCTAG TGTGGATTAC AAAGGTAGGG ACTTTGAGTT TATACCC 

SEQ. ID. UO. 526 

1 LMVDLFIAGS DTSAITTEWA MAELLRKPQV LNKVREEILQ QIGTERPVKE SDIEKLPYLQ 
61 AWKEAMRLH PAVSLLLPHK AQNDIQVLGY TVPKNTQVLV NAWAIGRDPK SWDKPLEFMP 
121 ERFIKSSVDY KGRDFEFIP 

SEQ. ID. no. 527 

D40-2 

1 CACATGAAAA TGTAAAGGAT TGTGTTGTTA GTGGATATCA CATTCCTAAA GGGACTAGAT 
61 TATTCGCAAA CGTCATGAAA CTGCAGCGCG ATCCTTJ^CT CTTGTCAAAT CCTGATAAGT 
121 TCGATCCAGA GAGATTCATC GCTGGTGATA TTGACTTCCG TGGTCACCAC TATGAGTTTA 
181 TCCCATTTGG TTCTGGAAGA CGATCTTGTC CGGGGATGAC TTATGCATTG CAAGTGGAAC 
241 ACCTAACAAT GGCACATTTA ATCCAGGGTT TCAATTACAA AACTCCAAAT GACGAGGCCT 
3 01 TGGATATGAA GGAAGGTGCA GGCATAACAA TACGTAAGGT AAATCCAGTG GAATTGATAA 
361 TAACGCCTCG CTTGGTACCT GAGCTTTACT AAAACCTAAG ATCTTTCATC TTGGTTGATC 
421 ATTGGTTAAT ACTCCTAGAT GGGTATTCAT TTACCTTTTT TCAATTAATT GCATGTCCAG 
481 CTTTTTTAAT TTGGTATATT T 

SEQ. ID. NO. 528 

1 HENVKDCWS GYHIPKGTKL FANVMKLQRD PKLLSNPDKF DPERFIAGDI DFRGHHYEFI 
61 PFGSGRRSCP GMTYALQVEH LTMAHLIQGF NYKTPNDEAL DMKEGAGITI RKVNPVELII 
121 TPRIiVPELY 

SEQ. ID. NO. 529 

D301-EE11 

1 TATAAGTATA TCCCGTTTGG TTCTGGAAGA CGATCTTGTC CAGGGATGAC TTATGCATTG 
61 CAAGTGGAAC ACTTAACAAT GGCACATTTG ATCCAAGGTT TCAATTACAG AACTCCAAAT 
121 GACGAGGCCT TGGGTATGAA GGAAGGTGCA GGCATAACTA TACGTAAGGT AAATCCTGTG 
181 GAACTGATAA TAGCGCCTCG CCTGGCACCT GAGCTTTATT AAAACCTAAG ATCTTTCATC 
241 TTGGTTGATC ATTGTATAAT ACTCCTAAAT GGATATTCAT TTACCTTTTA TCAATTAATT 
301 GTCA 

SEQ. ID. NO. 530 

1 YKYIPFGSGR RSCPGMTYAL QVEHLTMAHL IQGFNYRTPN DEPLGMKEGA GITIRKVNPV 
61 ELIIAPRLAP ELY 

SEQ. ID. NO. 531 

D3 02-AE10 

1 TTTTAAAGAT ATAGATTCTG TTTTTCAGAA TTGGTTAGAG GAACGTATTA ATAAAAGAGA 
61 AAAAATGGAG GTTAATGCAG AAGGGAATGA ACAAGATTTC ATTGATGTGG TGCTTTCAAA 
121 AATGAGTAAT GAATATCTTG GTGAAGGTTA CTCTCGTGAT ACTGTTATTA AAGCAACGGT 
181 GTTTAGTTTG GTCTTGGATG CAGCAGACAC AGTTGCTCTT CACATAAATT GGGGAATGGC 
241 ATTATTGATA AACAATCAAA ATGCCTTGAT GAAAGCACAA GAAGAGATAG ACACAAAAGT 
301 TGGTAAGGAT AGATGGGTAG AAGAGAGTGA TATTAAGGAT TTAGTATACC TCCAAGCTAT 
3 61 TGTTAAAAAG GTGTTACGAT TATATCCACC AGGACCTTTG TTA 
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SBQ. ID. no. 532 

1 FKDIDSVFQN WLEERINKRE KMEVNAEGNE QDFIDWLSK MSNEYLGEGY SRDTVIKATV 
61 FSLVLDAADT VALHINWGMA LLINNQNALM KAQEEIDTKV GKDRWVEESD IKDLVYLQAI 
121 VKKVLRLYPP GPLL 

SEQ. ID. MO. 533 

D303-AC6 

1 TATAAGTATA TCCCGTTTGG TTCTGGAAGA CGATCTTGTC CAGGGATGAC TTATGCATTG 

61 CAAGTGGAAC ACTTAACAAT GGCACATTTG ATCCAAGGTT TCGATTACAG AACTCCAAAT 

121 GACGAGCCCT TGGATATGAA GGAAGGTGCA GGCATAACTA TACGTAAGGT AAATCCTGTG 

181 GAACTGATAA TAGCGCCTCG CCTGGCACCT GAGCTTTATT AAAACCTAAG ATCTTTCATC 

241 TTGGTTGATC ATTGTATAAT ACTCCTAAAT GGATATTCAT TTACCTTTTA TCAATTAATT 

301 GTCA 

SBQ. ID. NO. 534 

1 YKYIPFGSGR RSCPGMTYAL QVEHLTMAHL IQGFDYRTPN DEPLDMKEGA GITIRKVNPV 
61 ELIIAPRLAP ELY 

SBQ. ID. no, 535 

D303-AC11 

1 GAGATTCATT GCTACTGATA TTGACTTTCG TGGTCAGTAC TATAAGTATA TCCCGTTTGG 
61 TTCTGGAAGG CGATCTTGTC CAGGGATGAC TTATGCATTG CAAGTGGAAC ACTTAACAAT 
121 GGCACATTTG ATCCAAGGTT TCAATTACAG AACTCCAAAT GACGAGCCCT TGGATATGAA 
181 GGAAGGTGCA GGCATAACTA TACGTAATGT AAATCCTGTG GAACTGATAA TAGCGCCTCG 
241 CCTGGCACCT GAGCTTTATT AAAACCTAAG ATCTTTCATC TTGGTTGATC ATTGTATAAT 
3 01 ACTCCTAAAT GGATATTCAT TTACCTTTTA TCAATTAATT GTCA 

SBQ. ID. NO. 536 



1 RFIATDIDFR 
61 EGAGITIRNV 



GQYYKYIPFG SGRRSCPGMT YALQVEHLTM 
NPVELIIAPR liAPELY 



AHLIQGFNYR 



TPNDEPLDMK 
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